Predictive models on 1D signals in a small-data environment
Langue
en
Rapport
Ce document a été publié dans
2021-04-28
Résumé en anglais
This report is concerned by one project done during the Semaine d'Études Mathématiques et Entreprises (SEME) at the Institut de Mathématiques de Bordeaux. The subject, proposed by the company FieldBox.ai, concerns the use ...Lire la suite >
This report is concerned by one project done during the Semaine d'Études Mathématiques et Entreprises (SEME) at the Institut de Mathématiques de Bordeaux. The subject, proposed by the company FieldBox.ai, concerns the use of machine learning algorithms and data augmentation techniques, applied to small datasets composed of 1D signals measurements. The target variable is supposed to be continuous, i.e., a regression problem. By first reviewing the literature and existing methods on data augmentation, we propose two procedures to tackle this problem: one allows to create synthetic observations for a specific range of target values and it is based on a perturbation method in Fourier/Wavelet space; the other is based on neural networks and uses a particular version of the Variational Autoencoder known as LSTM-VAE. Our methods are applied to an open dataset, available at the UCI repository, and show encouraging results for a common class of machine learning algorithms.< Réduire
Mots clés en anglais
small-data
machine learning
inferring techniques
data augmentation
imputation
Variational Autoencoder
LSTM
Fast Fourier Transform
Discrete Wavelet Transform
time series
class imbalance
Origine
Importé de halUnités de recherche