Rotations Are All You Need: A Generic Method For End-To-End Optical Music Recognition

Please use this identifier to cite or link to this item: http://hdl.handle.net/10045/138493
Información del item - Informació de l'item - Item information
Title: Rotations Are All You Need: A Generic Method For End-To-End Optical Music Recognition
Authors: Ríos-Vila, Antonio
Research Group/s: Reconocimiento de Formas e Inteligencia Artificial
Center, Department or Service: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Keywords: Optical Music Recognition | Monophonic Scores | Polyphonic Scores | Aligned Music Notation & Lyrics Transcription | Score Unfolding | Rotations
Issue Date: Nov-2023
Publisher: International Workshop on Reading Music Systems
Citation: Ríos-Vila, Antonio. “Rotations Are All You Need: A Generic Method For End-To-End Optical Music Recognition”. In: Calvo-Zaragoza, Jorge; Pacha, Alexander; Shatri, Elona (Eds.). Proceedings of the 5th International Workshop on Reading Music Systems: 4th November, 2023, Milan, Italy, pp. 34-38
Abstract: End-to-end Optical Music Recognition traditionally involves multi-step processes to address complex documents, where single stave isolation is performed. These state-of-the art methods often fall short in handling diverse music textures, such as polyphony or Aligned Music Notation and Lyrics Transcription (AMNLT). We introduce a generic end-to-end OMR approach compatible with monophonic, polyphonic, and AMNLT systems. By leveraging score rotations and multi-line transcription unfolding, this model only requires input system annotations during training. Experimental evidence suggests this approach offers a competitive solution with encouraging outcomes, paving the way for future end-to-end music transcription research.
Sponsor: This paper is part of the project MultiScore (PID2020-118447RA-I00), funded by MCIN/AEI/10.13039/ 501100011033. The author is supported by grant ACIF/2021/356 from “Programa I+D+i de la Generalitat Valenciana”.
URI: http://hdl.handle.net/10045/138493
Language: eng
Type: info:eu-repo/semantics/conferenceObject
Rights: © The respective authors. Licensed under a Creative Commons Attribution 4.0 International License (CC-BY-4.0).
Peer Review: si
Publisher version: https://doi.org/10.48550/arXiv.2311.04091
Appears in Collections:INV - GRFIA - Comunicaciones a Congresos, Conferencias, etc.

Files in This Item:


Items in RUA are protected by copyright, with all rights reserved, unless otherwise indicated.