Decoupling music notation to improve end-to-end Optical Music Recognition

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/123164
Registro completo de metadatos
Registro completo de metadatos
Campo DCValorIdioma
dc.contributorReconocimiento de Formas e Inteligencia Artificiales_ES
dc.contributor.authorAlfaro-Contreras, María-
dc.contributor.authorRíos-Vila, Antonio-
dc.contributor.authorValero-Mas, Jose J.-
dc.contributor.authorIñesta, José M.-
dc.contributor.authorCalvo-Zaragoza, Jorge-
dc.contributor.otherUniversidad de Alicante. Departamento de Lenguajes y Sistemas Informáticoses_ES
dc.contributor.otherUniversidad de Alicante. Instituto Universitario de Investigación Informáticaes_ES
dc.date.accessioned2022-04-26T11:17:08Z-
dc.date.available2022-04-26T11:17:08Z-
dc.date.issued2022-04-26-
dc.identifier.citationPattern Recognition Letters. 2022, 158: 157-163. https://doi.org/10.1016/j.patrec.2022.04.032es_ES
dc.identifier.issn0167-8655 (Print)-
dc.identifier.issn1872-7344 (Online)-
dc.identifier.urihttp://hdl.handle.net/10045/123164-
dc.description.abstractInspired by the Text Recognition field, end-to-end schemes based on Convolutional Recurrent Neural Networks (CRNN) trained with the Connectionist Temporal Classification (CTC) loss function are considered one of the current state-of-the-art techniques for staff-level Optical Music Recognition (OMR). Unlike text symbols, music-notation elements may be defined as a combination of (i) a shape primitive located in (ii) a certain position in a staff. However, this double nature is generally neglected in the learning process, as each combination is treated as a single token. In this work, we study whether exploiting such particularity of music notation actually benefits the recognition performance and, if so, which approach is the most appropriate. For that, we thoroughly review existing specific approaches that explore this premise and propose different combinations of them. Furthermore, considering the limitations observed in such approaches, a novel decoding strategy specifically designed for OMR is proposed. The results obtained with four different corpora of historical manuscripts show the relevance of leveraging this double nature of music notation since it outperforms the standard approaches where it is ignored. In addition, the proposed decoding leads to significant reductions in the error rates with respect to the other cases.es_ES
dc.description.sponsorshipThis paper is part of the project I+D+i PID2020-118447RA-I00 (MultiScore), funded by MCIN/AEI/10.13039/501100011033. The first author is supported by grant FPU19/04957 from the Spanish Ministerio de Universidades. The second author is supported by grant ACIF/2021/356 from “Programa I+D+i de la Generalitat Valenciana“. The third author is supported by grant APOSTD/2020/256 from “Programa I+D+i de la Generalitat Valenciana”.es_ES
dc.languageenges_ES
dc.publisherElsevieres_ES
dc.rights© 2022 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).es_ES
dc.subjectOptical Music Recognitiones_ES
dc.subjectDeep Learninges_ES
dc.subjectConnectionist Temporal Classificationes_ES
dc.subjectSequence Labelinges_ES
dc.subject.otherLenguajes y Sistemas Informáticoses_ES
dc.titleDecoupling music notation to improve end-to-end Optical Music Recognitiones_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.peerreviewedsies_ES
dc.identifier.doi10.1016/j.patrec.2022.04.032-
dc.relation.publisherversionhttps://doi.org/10.1016/j.patrec.2022.04.032es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-118447RA-I00es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MICINN//FPU19%2F04957es_ES
Aparece en las colecciones:INV - GRFIA - Artículos de Revistas

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
ThumbnailAlfaro-Contreras_etal_2022_PatternRecognLett.pdf1,07 MBAdobe PDFAbrir Vista previa


Este ítem está licenciado bajo Licencia Creative Commons Creative Commons