HHT-based audio coding
hal.structure.identifier | Institut de Recherche de l'Ecole Navale [IRENAV] | |
dc.contributor.author | KHALDI, Kais | |
hal.structure.identifier | Institut de Recherche de l'Ecole Navale [IRENAV] | |
dc.contributor.author | BOUDRAA, Abdel-Ouahab | |
hal.structure.identifier | Laboratoire d'Analyse, Topologie, Probabilités [LATP] | |
dc.contributor.author | TORRÉSANI, Bruno | |
hal.structure.identifier | Département Signal et Communications [SC] | |
hal.structure.identifier | Lab-STICC_TB_CID_TOMS | |
dc.contributor.author | CHONAVEL, Thierry | |
dc.date.accessioned | 2021-05-14T10:03:34Z | |
dc.date.available | 2021-05-14T10:03:34Z | |
dc.date.created | 2012 | |
dc.date.issued | 2013 | |
dc.identifier.issn | 1863-1703 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/78416 | |
dc.description.abstractEn | In this paper a new audio coding scheme combining the Hilbert transform and the Empirical Mode Decomposition (EMD) is introduced. Based on the EMD, the coding is fully data-driven approach. Audio signal is first decomposed adaptively, by EMD, into intrinsic oscillatory components called Intrinsic Mode Functions (IMFs). The key idea of this work is to code both instantaneous amplitude (IA) and instantaneous frequency (IF), of the extracted IMFs, calculated using Hilbert transform. Since IA (resp. IF) is strongly correlated, it is encoded via a linear prediction technique. The decoder recovers the original signal by superposition of the demodulated IMFs. The proposed approach is applied to audio signals, and the results are compared to those obtained by AAC (Advanced Audio Coding) and MP3 codecs, and wavelets based compression. Coding performances are evaluated using the bit rate, Objective Difference Grade (ODG) and Noise to Mask Ratio (NMR) measures. Based on the analyzed audio signals, overall, our coding scheme performs better than wavelet compression, AAC and MP3 codecs. Results also show that this new scheme has good coding performances without significant perceptual distortion, resulting in an ODG in range [-1,0] and large negative NMR values. | |
dc.language.iso | en | |
dc.publisher | Springer Verlag | |
dc.subject.en | Hilbert-Huang transform | |
dc.subject.en | Hilbert transform | |
dc.subject.en | Intrinsic mode function | |
dc.subject.en | Linear prediction | |
dc.subject.en | Audio coding | |
dc.subject.en | Empirical mode decomposition | |
dc.subject.en | Linear prediction. | |
dc.title.en | HHT-based audio coding | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1007/s11760-013-0433-6 | |
dc.subject.hal | Sciences de l'ingénieur [physics]/Traitement du signal et de l'image | |
dc.subject.hal | Informatique [cs]/Traitement du signal et de l'image | |
bordeaux.journal | Signal, Image and Video Processing | |
bordeaux.page | 1-9 | |
bordeaux.volume | 7 | |
bordeaux.hal.laboratories | Institut de Mécanique et d’Ingénierie de Bordeaux (I2M) - UMR 5295 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.institution | INRAE | |
bordeaux.institution | Arts et Métiers | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-00818033 | |
hal.version | 1 | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-00818033v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Signal,%20Image%20and%20Video%20Processing&rft.date=2013&rft.volume=7&rft.spage=1-9&rft.epage=1-9&rft.eissn=1863-1703&rft.issn=1863-1703&rft.au=KHALDI,%20Kais&BOUDRAA,%20Abdel-Ouahab&TORR%C3%89SANI,%20Bruno&CHONAVEL,%20Thierry&rft.genre=article |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |