Audio coding via EMD
hal.structure.identifier | Institut de Recherche de l'Ecole Navale [IRENAV] | |
dc.contributor.author | BOUDRAA, Abdel-Ouahab | |
hal.structure.identifier | Institut de Recherche de l'Ecole Navale [IRENAV] | |
dc.contributor.author | KHALDI, Kais | |
hal.structure.identifier | Département Signal et Communications [IMT Atlantique - SC] | |
hal.structure.identifier | Lab-STICC_IMTA_CID_TOMS | |
dc.contributor.author | CHONAVEL, Thierry | |
dc.contributor.author | TURKI HADJ-ALOUANE, Mounia | |
hal.structure.identifier | Institut de Recherche de l'Ecole Navale [IRENAV] | |
dc.contributor.author | KOMATY, Ali | |
dc.date.accessioned | 2021-05-14T09:33:50Z | |
dc.date.available | 2021-05-14T09:33:50Z | |
dc.date.issued | 2020-09 | |
dc.identifier.issn | 1051-2004 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/76083 | |
dc.description.abstractEn | In this paper an audio coding scheme based on the empirical mode decomposition in association with a psychoacoustic model is presented. The principle of the method consists in breaking down adaptively the audio signal into intrinsic oscillatory components, called Intrinsic Mode Functions (IMFs), that are fully described by their local extrema. These extrema are encoded. The coding is carried out frame by frame and no assumption is made upon the signal to be coded. The number of allocated bits varies from mode to mode and obeys to the coding error inaudibility constraint. Due to the symmetry of an IMF, only the extrema (maxima or minima) of one of its interpolating envelopes are perceptually coded. In addition, to deal with rapidly changing audio signals, a stationarity index is used and when a transient is detected, the frame is split into two overlapping sub-frames. At the decoder side, the IMFs are recovered using the associated coded maxima, and the original signal is reconstructed by IMFs summation. Performance of the proposed coding is analyzed and compared to that of MP3 and AAC codecs, and the wavelet-based coding approach. Based on the analyzed mono audio signals, the obtained results show that the proposed coding scheme outperforms the MP3 and the wavelet-based coding methods and performs slightly better than the AAC codec, showing thus the potential of the EMD for data-driven audio coding. | |
dc.language.iso | en | |
dc.publisher | Elsevier | |
dc.subject.en | Empirical mode decomposition | |
dc.subject.en | Empirical mode compression | |
dc.subject.en | Audio coding | |
dc.subject.en | Sub-band coding | |
dc.subject.en | Stationarity index | |
dc.subject.en | Psychoacoustic model | |
dc.title.en | Audio coding via EMD | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1016/j.dsp.2020.102770 | |
dc.subject.hal | Informatique [cs]/Traitement du signal et de l'image | |
dc.subject.hal | Sciences de l'ingénieur [physics]/Traitement du signal et de l'image | |
bordeaux.journal | Digital Signal Processing | |
bordeaux.page | 102770 | |
bordeaux.volume | 104 | |
bordeaux.hal.laboratories | Institut de Mécanique et d’Ingénierie de Bordeaux (I2M) - UMR 5295 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.institution | INRAE | |
bordeaux.institution | Arts et Métiers | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-02902533 | |
hal.version | 1 | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-02902533v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Digital%20Signal%20Processing&rft.date=2020-09&rft.volume=104&rft.spage=102770&rft.epage=102770&rft.eissn=1051-2004&rft.issn=1051-2004&rft.au=BOUDRAA,%20Abdel-Ouahab&KHALDI,%20Kais&CHONAVEL,%20Thierry&TURKI%20HADJ-ALOUANE,%20Mounia&KOMATY,%20Ali&rft.genre=article |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |