MusicFactoryApplication of a Convolutional Neural Network for the Generation of Soundscapes from Images

  1. Juan José Navarro-Cáceres
  2. André Sales Mendes
  3. Hector Sánchez San Blas
  4. Gabriel Villarrubia González
  5. María Navarro-Cáceres
Libro:
New Trends in Disruptive Technologies, Tech Ethics and Artificial Intelligence: The DITTET 2022 Collection
  1. Daniel H. de la Iglesia (ed. lit.)
  2. Juan F. de Paz Santana (ed. lit.)
  3. Alfonso J. López Rivero (ed. lit.)

Editorial: Springer International Publishing AG

ISBN: 978-3-031-14858-3

Año de publicación: 2023

Páginas: 156-164

Congreso: DiTTEt: International Conference on Disruptive Technologies, Tech Ethics and Artificial Intelligence (2. 2022. Salamanca)

Tipo: Aportación congreso

Resumen

A soundscape is a sound description of a concrete environment. Therefore, the soundscapes are always connected to a visual component, as it might capture sounds from an urban city, a countryside, or a domestic place. In this work, we present a system that generate soundscapes from images. Firstly, we recognize some objects in the image. In a second step the system searches the most adequate sounds according to the entities identified in the picture. Finally, a soundscape is synthesized by combining the short sound files found. The results obtained according to the subjective evaluation are promising and encouraging to deepen our research in the soundscape generation.