MusicFactoryApplication of a Convolutional Neural Network for the Generation of Soundscapes from Images
- Juan José Navarro-Cáceres
- André Sales Mendes
- Hector Sánchez San Blas
- Gabriel Villarrubia González
- María Navarro-Cáceres
- Daniel H. de la Iglesia (ed. lit.)
- Juan F. de Paz Santana (ed. lit.)
- Alfonso J. López Rivero (ed. lit.)
Éditorial: Springer International Publishing AG
ISBN: 978-3-031-14858-3
Année de publication: 2023
Pages: 156-164
Congreso: DiTTEt: International Conference on Disruptive Technologies, Tech Ethics and Artificial Intelligence (2. 2022. Salamanca)
Type: Communication dans un congrès
Résumé
A soundscape is a sound description of a concrete environment. Therefore, the soundscapes are always connected to a visual component, as it might capture sounds from an urban city, a countryside, or a domestic place. In this work, we present a system that generate soundscapes from images. Firstly, we recognize some objects in the image. In a second step the system searches the most adequate sounds according to the entities identified in the picture. Finally, a soundscape is synthesized by combining the short sound files found. The results obtained according to the subjective evaluation are promising and encouraging to deepen our research in the soundscape generation.