Vocal Boost: Software para PC para ajustar el volumen de los diálogos en tiempo real usando inteligencia artificial
Fecha
2024-10-16
Autores
Título de la revista
ISSN de la revista
Título del volumen
Editor
Resumen
[ES] El presente Trabajo de Fin de Grado (TFG) se centra en el desarrollo de un software para PC en Linux que permite regular el volumen de los diálogos de forma independiente al resto de sonidos, enfocado principalmente en películas o videos. Se ha utilizado la red neuronal profunda DEMUCS para separar la voz del resto de audio y se ha adaptado la misma para su funcionamiento en tiempo real. El software se ha integrado en una aplicación con interfaz gráfica para el sistema de sonido, utilizando herramientas como PulseAudio y Sounddevice. El resultado es una aplicación que permite al usuario ajustar el volumen de los diálogos de manera independiente al resto de sonidos en tiempo real.
[EN] This Final Degree Project (TFG) focuses on the development of software for PC on Linux that allows users to independently adjust the volume of dialogues separate from other sounds, primarily in movies or videos. The deep neural network DEMUCS was used to separate the voice from the rest of the audio and has been adapted for realtime operation. The software has been integrated into an application with a graphical interface for the sound system, utilizing tools like PulseAudio and Sounddevice. The result is an application that enables users to adjust the volume of dialogues independently from other sounds in real-time.
[EN] This Final Degree Project (TFG) focuses on the development of software for PC on Linux that allows users to independently adjust the volume of dialogues separate from other sounds, primarily in movies or videos. The deep neural network DEMUCS was used to separate the voice from the rest of the audio and has been adapted for realtime operation. The software has been integrated into an application with a graphical interface for the sound system, utilizing tools like PulseAudio and Sounddevice. The result is an application that enables users to adjust the volume of dialogues independently from other sounds in real-time.
Descripción
Palabras clave
Sistemas de Telecomunicación e Imagen y sonido