A holistic approach for image-to-graph: application to optical music recognition

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/127199
Información del item - Informació de l'item - Item information
Título: A holistic approach for image-to-graph: application to optical music recognition
Autor/es: Garrido Muñoz, Carlos | Ríos-Vila, Antonio | Calvo-Zaragoza, Jorge
Grupo/s de investigación o GITE: Reconocimiento de Formas e Inteligencia Artificial
Centro, Departamento o Servicio: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos | Universidad de Alicante. Instituto Universitario de Investigación Informática
Palabras clave: Optical music recognition | Graph representation | Deep learning
Fecha de publicación: 16-sep-2022
Editor: Springer Nature
Cita bibliográfica: International Journal on Document Analysis and Recognition (IJDAR). 2022, 25: 293-303. https://doi.org/10.1007/s10032-022-00417-4
Resumen: A number of applications would benefit from neural approaches that are capable of generating graphs from images in an end-to-end fashion. One of these fields is optical music recognition (OMR), which focuses on the computational reading of music notation from document images. Given that music notation can be expressed as a graph, the aforementioned approach represents a promising solution for OMR. In this work, we propose a new neural architecture that retrieves a certain representation of a graph—identified by a specific order of its vertices—in an end-to-end manner. This architecture works by means of a double output: It sequentially predicts the possible categories of the vertices, along with the edges between each of their pairs. The experiments carried out prove the effectiveness of our proposal as regards retrieving graph structures from excerpts of handwritten musical notation. Our results also show that certain design decisions, such as the choice of graph representations, play a fundamental role in the performance of this approach.
Patrocinador/es: Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. Work produced with the support of a 2021 Leonardo Grant for Researchers and Cultural Creators, BBVA Foundation. The Foundation takes no responsibility for the opinions, statements and contents of this project, which are entirely the responsibility of its authors. The second author is supported by grant ACIF/2021/356 from the “Programa I+D+i de la Generalitat Valenciana”.
URI: http://hdl.handle.net/10045/127199
ISSN: 1433-2833 (Print) | 1433-2825 (Online)
DOI: 10.1007/s10032-022-00417-4
Idioma: eng
Tipo: info:eu-repo/semantics/article
Derechos: © The Author(s) 2022. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Revisión científica: si
Versión del editor: https://doi.org/10.1007/s10032-022-00417-4
Aparece en las colecciones:INV - GRFIA - Artículos de Revistas

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
ThumbnailGarrido-Munoz_etal_2022_IJDAR.pdf1,01 MBAdobe PDFAbrir Vista previa


Este ítem está licenciado bajo Licencia Creative Commons Creative Commons