Using Large Language Models to Enhance the Reusability of Sensor Data

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/139617
Registro completo de metadatos
Registro completo de metadatos
Campo DCValorIdioma
dc.contributorWeb and Knowledge (WaKe)es_ES
dc.contributorProcesamiento del Lenguaje y Sistemas de Información (GPLSI)es_ES
dc.contributor.authorBerenguer, Alberto-
dc.contributor.authorMorejón, Adriana-
dc.contributor.authorTomás, David-
dc.contributor.authorMazón, Jose-Norberto-
dc.contributor.otherUniversidad de Alicante. Departamento de Lenguajes y Sistemas Informáticoses_ES
dc.date.accessioned2024-01-10T08:44:06Z-
dc.date.available2024-01-10T08:44:06Z-
dc.date.issued2024-01-06-
dc.identifier.citationBerenguer A, Morejón A, Tomás D, Mazón J-N. Using Large Language Models to Enhance the Reusability of Sensor Data. Sensors. 2024; 24(2):347. https://doi.org/10.3390/s24020347es_ES
dc.identifier.issn1424-8220-
dc.identifier.urihttp://hdl.handle.net/10045/139617-
dc.description.abstractThe Internet of Things generates vast data volumes via diverse sensors, yet its potential remains unexploited for innovative data-driven products and services. Limitations arise from sensor-dependent data handling by manufacturers and user companies, hindering third-party access and comprehension. Initiatives like the European Data Act aim to enable high-quality access to sensor-generated data by regulating accuracy, completeness, and relevance while respecting intellectual property rights. Despite data availability, interoperability challenges impede sensor data reusability. For instance, sensor data shared in HTML formats requires an intricate, time-consuming processing to attain reusable formats like JSON or XML. This study introduces a methodology aimed at converting raw sensor data extracted from web portals into structured formats, thereby enhancing data reusability. The approach utilises large language models to derive structured formats from sensor data initially presented in non-interoperable formats. The effectiveness of these language models was assessed through quantitative and qualitative evaluations in a use case involving meteorological data. In the proposed experiments, GPT-4, the best performing LLM tested, demonstrated the feasibility of this methodology, achieving a precision of 93.51% and a recall of 85.33% in converting HTML to JSON/XML, thus confirming its potential in obtaining reusable sensor data.es_ES
dc.description.sponsorshipThis research was funded by MCIN/AEI/10.13039/501100011033 and by the European Union Next Generation EU/PRTR as part of the projects TED2021130890B-C21 and PID2021-122263OB-C22. Alberto Berenguer has a contract for predoctoral training with “Generalitat Valenciana” and the European Social Fund, funded by grant number ACIF/2021/507.es_ES
dc.languageenges_ES
dc.publisherMDPIes_ES
dc.rights© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).es_ES
dc.subjectInternet of Thingses_ES
dc.subjectSensor dataes_ES
dc.subjectInteroperabilityes_ES
dc.subjectData reusabilityes_ES
dc.subjectData processinges_ES
dc.titleUsing Large Language Models to Enhance the Reusability of Sensor Dataes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.peerreviewedsies_ES
dc.identifier.doi10.3390/s24020347-
dc.relation.publisherversionhttps://doi.org/10.3390/s24020347es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/TED2021-130890B-C21es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2021-122263OB-C22es_ES
Aparece en las colecciones:INV - GPLSI - Artículos de Revistas
INV - WaKe - Artículos de Revistas

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
ThumbnailBerenguer_etal_2024_Sensors.pdf453,59 kBAdobe PDFAbrir Vista previa


Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.