Please use this identifier to cite or link to this item:
Title: Classification of multiple sound events in a single frame using generative adversarial networks
metadata.dc.creator: Rodrigo González Huerta
Keywords: Ingeniería y Tecnología;Ciencias Tecnológicas;Otras especialidades tecnológicas 1-Jun-2023
Publisher: Ingeniería
Description: Environmental sound classification is a computational task that belongs to the branch of artificial intelligence called sound recognition. Several techniques and different approaches exist to tackle this task; one that yields excellent results is through the utilization of deep learning techniques, i.e., neural networks. Despite their good results, neural networks in some cases fail to generalize well to new data when the amount of training data is scarce. This can lead to a phenomena called overfitting. A solution to this inconvenience is based on the use of deep generative models to generate synthetic data through the approximation of high-dimensional probability distributions. This allow to generate new samples, similar to the ones used to train the generative model. Generative Adversarial Networks (GANs) are a kind of generative model which trains two neural networks simultaneously in an adversarial way, i.e., pitting one against the other. In this work it is shown the effect of using GANs as data augmentation technique that could be used to improve the performance of different sound classification models.
Appears in Collections:Maestría en Ciencias en Inteligencia Artificial

Files in This Item:
File Description SizeFormat 
RI007523.pdf6.46 MBAdobe PDFThumbnail

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.