Rotations Are All You Need: A Generic Method For End-To-End Optical Music Recognition

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/138493
Información del item - Informació de l'item - Item information
Título: Rotations Are All You Need: A Generic Method For End-To-End Optical Music Recognition
Autor/es: Ríos-Vila, Antonio
Grupo/s de investigación o GITE: Reconocimiento de Formas e Inteligencia Artificial
Centro, Departamento o Servicio: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Palabras clave: Optical Music Recognition | Monophonic Scores | Polyphonic Scores | Aligned Music Notation & Lyrics Transcription | Score Unfolding | Rotations
Fecha de publicación: nov-2023
Editor: International Workshop on Reading Music Systems
Cita bibliográfica: Ríos-Vila, Antonio. “Rotations Are All You Need: A Generic Method For End-To-End Optical Music Recognition”. In: Calvo-Zaragoza, Jorge; Pacha, Alexander; Shatri, Elona (Eds.). Proceedings of the 5th International Workshop on Reading Music Systems: 4th November, 2023, Milan, Italy, pp. 34-38
Resumen: End-to-end Optical Music Recognition traditionally involves multi-step processes to address complex documents, where single stave isolation is performed. These state-of-the art methods often fall short in handling diverse music textures, such as polyphony or Aligned Music Notation and Lyrics Transcription (AMNLT). We introduce a generic end-to-end OMR approach compatible with monophonic, polyphonic, and AMNLT systems. By leveraging score rotations and multi-line transcription unfolding, this model only requires input system annotations during training. Experimental evidence suggests this approach offers a competitive solution with encouraging outcomes, paving the way for future end-to-end music transcription research.
Patrocinador/es: This paper is part of the project MultiScore (PID2020-118447RA-I00), funded by MCIN/AEI/10.13039/ 501100011033. The author is supported by grant ACIF/2021/356 from “Programa I+D+i de la Generalitat Valenciana”.
URI: http://hdl.handle.net/10045/138493
Idioma: eng
Tipo: info:eu-repo/semantics/conferenceObject
Derechos: © The respective authors. Licensed under a Creative Commons Attribution 4.0 International License (CC-BY-4.0).
Revisión científica: si
Versión del editor: https://doi.org/10.48550/arXiv.2311.04091
Aparece en las colecciones:INV - GRFIA - Comunicaciones a Congresos, Conferencias, etc.

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
ThumbnailRios-Vila_Proceedings-5th-International-Workshop-on-Reading-Music-Systems.pdf2,06 MBAdobe PDFAbrir Vista previa


Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.