Classification of multiple sound events in a single frame using generative adversarial networks

Rodrigo González Huerta

Por favor, use este identificador para citar o enlazar este ítem: https://ri-ng.uaq.mx/handle/123456789/8442

Título :	Classification of multiple sound events in a single frame using generative adversarial networks
Autor(es):	Rodrigo González Huerta
Palabras clave:	Ingeniería y Tecnología Ciencias Tecnológicas Otras especialidades tecnológicas
Fecha de publicación :	1-jun-2023
Editorial :	Ingeniería
Facultad:	Facultad de Ingeniería
Programa académico:	Maestría en Ciencias en Inteligencia Artificial
Resumen:	Environmental sound classification is a computational task that belongs to the branch of artificial intelligence called sound recognition. Several techniques and different approaches exist to tackle this task; one that yields excellent results is through the utilization of deep learning techniques, i.e., neural networks. Despite their good results, neural networks in some cases fail to generalize well to new data when the amount of training data is scarce. This can lead to a phenomena called overfitting. A solution to this inconvenience is based on the use of deep generative models to generate synthetic data through the approximation of high-dimensional probability distributions. This allow to generate new samples, similar to the ones used to train the generative model. Generative Adversarial Networks (GANs) are a kind of generative model which trains two neural networks simultaneously in an adversarial way, i.e., pitting one against the other. In this work it is shown the effect of using GANs as data augmentation technique that could be used to improve the performance of different sound classification models.
URI:	https://ri-ng.uaq.mx/handle/123456789/8442
Aparece en:	Maestría en Ciencias en Inteligencia Artificial

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
RI007523.pdf		6.46 MB	Adobe PDF	Visualizar/Abrir

Mostrar el registro Dublin Core completo del ítem