Speech enhancement using empirical mode decomposition and the Teager–Kaiser energy operator
hal.structure.identifier | Ecole Nationale d'Ingénieurs de Tunis [ENIT] | |
dc.contributor.author | KHALDI, Kais | |
hal.structure.identifier | Institut de Recherche de l'Ecole Navale [IRENAV] | |
dc.contributor.author | BOUDRAA, Abdel-Ouahab | |
hal.structure.identifier | Institut de Recherche de l'Ecole Navale [IRENAV] | |
dc.contributor.author | KOMATY, Ali | |
dc.date.accessioned | 2021-05-14T09:58:47Z | |
dc.date.available | 2021-05-14T09:58:47Z | |
dc.date.issued | 2014-01 | |
dc.identifier.issn | 0001-4966 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/77976 | |
dc.description.abstractEn | In this paper a speech denoising strategy based on time adaptive thresholding of intrinsic modes functions (IMFs) of the signal, extracted by empirical mode decomposition (EMD), is introduced. The denoised signal is reconstructed by the superposition of its adaptive thresholded IMFs. Adaptive thresholds are estimated using the Teager–Kaiser energy operator (TKEO) of signal IMFs. More precisely, TKEO identifies the type of frame by expanding differences between speech and non-speech frames in each IMF. Based on the EMD, the proposed speech denoising scheme isa fully data-driven approach. The method is tested on speech signals with different noise levels and the results are compared to EMD-shrinkage and wavelet transform (WT) coupled with TKEO. Speech enhancement performance is evaluated using output signal to noise ratio (SNR) and perceptual evaluation of speech quality (PESQ) measure. Based on the analyzed speech signals, the proposed enhancement scheme performs better than WT-TKEO and EMD-shrinkage approaches in terms of output SNR and PESQ. The noise is greatly reduced using time-adaptive thresholding than universal thresholding. The study is limited to signals corrupted by additive white Gaussian noise. | |
dc.language.iso | en | |
dc.publisher | Acoustical Society of America | |
dc.subject | Interpolation | |
dc.subject | Signal de Parole | |
dc.subject | Opérateur de Teager-Kaiser | |
dc.subject | Speech | |
dc.subject | Speech analysis | |
dc.subject | Wavelets | |
dc.subject | Décomposition modale empirique | |
dc.subject | Noise propagation | |
dc.title.en | Speech enhancement using empirical mode decomposition and the Teager–Kaiser energy operator | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1121/1.4837835 | |
dc.subject.hal | Sciences de l'ingénieur [physics]/Acoustique [physics.class-ph] | |
dc.subject.hal | Sciences de l'ingénieur [physics]/Traitement du signal et de l'image | |
bordeaux.journal | Journal of the Acoustical Society of America | |
bordeaux.page | 451-459 | |
bordeaux.volume | 135 | |
bordeaux.hal.laboratories | Institut de Mécanique et d’Ingénierie de Bordeaux (I2M) - UMR 5295 | * |
bordeaux.issue | 1 | |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.institution | INRAE | |
bordeaux.institution | Arts et Métiers | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-01084175 | |
hal.version | 1 | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-01084175v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.date=2014-01&rft.volume=135&rft.issue=1&rft.spage=451-459&rft.epage=451-459&rft.eissn=0001-4966&rft.issn=0001-4966&rft.au=KHALDI,%20Kais&BOUDRAA,%20Abdel-Ouahab&KOMATY,%20Ali&rft.genre=article |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |