A machine learning framework for the categorization of elements in images of musical documents

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/135636
Registro completo de metadatos
Registro completo de metadatos
Campo DCValorIdioma
dc.contributorReconocimiento de Formas e Inteligencia Artificiales_ES
dc.contributor.authorCalvo-Zaragoza, Jorge-
dc.contributor.authorVigliensoni, Gabriel-
dc.contributor.authorFujinaga, Ichiro-
dc.contributor.otherUniversidad de Alicante. Departamento de Lenguajes y Sistemas Informáticoses_ES
dc.date.accessioned2023-06-30T09:46:38Z-
dc.date.available2023-06-30T09:46:38Z-
dc.date.issued2017-
dc.identifier.citationCalvo-Zaragoza, Jorge; Vigliensoni, Gabriel; Fujinaga, Ichiro. “A machine learning framework for the categorization of elements in images of musical documents”. In: Lopez Palma, Helena, et al. (Eds.). Proceedings of the Third International Conference on Technologies for Music Notation and Representation TENOR 2017. A Coruña: Universidade da Coruña, Servizo de Publicacións, 2017. ISBN 978-84-9749-666-7, pp. 17-23es_ES
dc.identifier.isbn978-84-9749-666-7-
dc.identifier.urihttp://hdl.handle.net/10045/135636-
dc.description.abstractMusical documents may contain heterogeneous information such as music symbols, text, staff lines, ornaments, annotations, and editorial data. Before any attempt at automatically recognizing the information on scores, it is usually necessary to detect and classify each constituent layer of information into different categories. The greatest obstacle of this classification process is the high heterogeneity among music collections, which makes it difficult to propose methods that can be generalizable to a broad range of sources. In this paper we propose a novel machine learning framework that focuses on extracting the different layers within musical documents by categorizing the image at pixel level. The main advantage of our approach is that it can be used regardless of the type of document provided, as long as training data is available. We illustrate some of the capabilities of the framework by showing examples of common tasks that are frequently performed on images of musical documents, such as binarization, staff-line removal, symbol isolation, and complete layout analysis. All these are tasks for which our approach has shown promising performance. We believe our framework will allow the development of generalizable and scalable automatic music recognition systems, thus facilitating the creation of large-scale browsable and searchable repositories of music documents.es_ES
dc.description.sponsorshipThis work was partially supported by the Social Sciences and Humanities Research Council of Canada and the Spanish Ministerio de Educación, Cultura y Deporte through a FPU Fellowship (Ref. AP2012–0939).es_ES
dc.languageenges_ES
dc.publisherUniversidade da Coruña. Servizo de Publicaciónses_ES
dc.rights© 2017 Jorge Calvo-Zaragoza et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.es_ES
dc.subjectMachine learning frameworkes_ES
dc.subjectMusical documentses_ES
dc.subjectCategorization of elements in imageses_ES
dc.titleA machine learning framework for the categorization of elements in images of musical documentses_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.peerreviewedsies_ES
dc.relation.publisherversionhttp://hdl.handle.net/2183/18494es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
Aparece en las colecciones:INV - GRFIA - Comunicaciones a Congresos, Conferencias, etc.



Este ítem está licenciado bajo Licencia Creative Commons Creative Commons