End-To-End Full-Page Optical Music Recognition of Monophonic Documents via Score Unfolding

Please use this identifier to cite or link to this item: http://hdl.handle.net/10045/130018
Información del item - Informació de l'item - Item information
Title: End-To-End Full-Page Optical Music Recognition of Monophonic Documents via Score Unfolding
Authors: Ríos-Vila, Antonio | Iñesta, José M. | Calvo-Zaragoza, Jorge
Research Group/s: Reconocimiento de Formas e Inteligencia Artificial
Center, Department or Service: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Keywords: Optical Music Recognition | Full Page | Monophonic Documents | Score Unfolding
Issue Date: Nov-2022
Publisher: Workshop on Reading Music Systems
Citation: Ríos-Vila, Antonio; Iñesta, José M.; Calvo-Zaragoza, Jorge. “End-To-End Full-Page Optical Music Recognition of Monophonic Documents via Score Unfolding”. In: Calvo-Zaragoza, Jorge; Pacha, Alexander; Shatri, Elona (Eds.). Proceedings of the 4th International Workshop on Reading Music Systems, 18th November, 2022, pp. 20-24
Abstract: Full Page Optical Music Recognition (OMR) systems typically consist of multi-step workflows. However, the fine-tuning of these systems tends to be costly. We present the first layout analysis-free full-page OMR model that receives a page image and directly outputs its transcription in a single step. This model requires only the annotations of full score pages during training. The model has been tested with early-notation monophonic music scores, for which the presented approach is especially beneficial. Results show that this methodology provides a solution with promising results and establishes a new line of research for end-to-end music transcription.
Sponsor: This paper is part of the project MultiScore (PID2020-118447RA-I00), funded by MCIN/AEI/10.13039/ 501100011033. The first author is supported by grant ACIF/2021/356 from “Programa I+D+i de la Generalitat Valenciana”. Third author was supported with a 2021 Leonardo Grant for Researchers and Cultural Creators, BBVA Foundation.
URI: http://hdl.handle.net/10045/130018
Language: eng
Type: info:eu-repo/semantics/conferenceObject
Rights: © The respective authors. Licensed under a Creative Commons Attribution 4.0 International License (CC-BY-4.0).
Peer Review: si
Appears in Collections:INV - GRFIA - Comunicaciones a Congresos, Conferencias, etc.

Files in This Item:
Files in This Item:
File Description SizeFormat 
ThumbnailEnd-To-End-Full-Page-Optical-Music-Recognition.pdf2,65 MBAdobe PDFOpen Preview


This item is licensed under a Creative Commons License Creative Commons