A Preliminary Study of Few-shot Learning for Layout Analysis of Music Scores
Por favor, use este identificador para citar o enlazar este ítem:
http://hdl.handle.net/10045/138500
Título: | A Preliminary Study of Few-shot Learning for Layout Analysis of Music Scores |
---|---|
Autor/es: | Castellanos, Francisco J. | Gallego, Antonio-Javier | Fujinaga, Ichiro |
Grupo/s de investigación o GITE: | Reconocimiento de Formas e Inteligencia Artificial |
Centro, Departamento o Servicio: | Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos | Universidad de Alicante. Instituto Universitario de Investigación Informática |
Palabras clave: | Few-shot learning | Layout analysis | Music scores | Optical Music Recognition |
Fecha de publicación: | nov-2023 |
Editor: | International Workshop on Reading Music Systems |
Cita bibliográfica: | Castellanos, Francisco J.; Gallego, Antonio Javier; Fujinaga, Ichiro. “A Preliminary Study of Few-shot Learning for Layout Analysis of Music Scores”. In: Calvo-Zaragoza, Jorge; Pacha, Alexander; Shatri, Elona (Eds.). Proceedings of the 5th International Workshop on Reading Music Systems: 4th November, 2023, Milan, Italy, pp. 44-48 |
Resumen: | Few-shot techniques offer a promising avenue to reduce the high demand for annotated data required by current machine learning-based applications, such as Optical Music Recognition (OMR). This is a field dedicated to the automatic transcription of music notation from sheet music images. Traditional OMR systems strongly depend on layout analysis, a crucial step involving the identification and segmentation of several components within a music score, such as staff lines, text, or notes. The standard approach requires extensive fully annotated training data, which are resource-intensive and time-consuming to label and curate by domain experts. We present a preliminary study on the use of few-shot learning to alleviate the disadvantages associated with manual annotations. The proposal minimizes the human effort required by employing only partial annotations. For this, we introduce an oversampling technique to train models using a limited set of annotated patches extracted from the score images. Our experimental findings, conducted on four benchmark datasets, underscore the efficacy of the proposed patch extraction. Despite operating with a reduced amount of annotated data, our method achieves performance levels competitive with models trained on the complete dataset. This work points out the potential of few-shot learning in the context of layout analysis for music scores, offering the promise of more efficient and accessible OMR systems. |
Patrocinador/es: | This research was supported by the I+D+i project TED2021-132103A-I00 (DOREMI), funded by MCIN/AEI/10.13039/501100011033, the Social Sciences and Humanities Research Council (895-2013-1012) and the Fonds de recherche du Québec-Société et Culture (2022-SE3-303927). |
URI: | http://hdl.handle.net/10045/138500 |
Idioma: | eng |
Tipo: | info:eu-repo/semantics/conferenceObject |
Derechos: | © The respective authors. Licensed under a Creative Commons Attribution 4.0 International License (CC-BY-4.0). |
Revisión científica: | si |
Versión del editor: | https://doi.org/10.48550/arXiv.2311.04091 |
Aparece en las colecciones: | INV - GRFIA - Comunicaciones a Congresos, Conferencias, etc. |
Archivos en este ítem:
Archivo | Descripción | Tamaño | Formato | |
---|---|---|---|---|
Castellanos_etal_Proceedings-5th-International-Workshop-on-Reading-Music-Systems.pdf | 1,45 MB | Adobe PDF | Abrir Vista previa | |
Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.