Overview of Rest-Mex at IberLEF 2023: Research on Sentiment Analysis Task for Mexican Tourist Texts

Álvarez-Carmona, Miguel Á.; Díaz-Pacheco, Ángel; Aranda, Ramón; Rodríguez-González, Ansel Y.; Muñiz-Sánchez, Victor; López-Monroy, A. Pastor; Sanchez-Vega, Fernando; Bustio-Martínez, Lázaro

Overview of Rest-Mex at IberLEF 2023: Research on Sentiment Analysis Task for Mexican Tourist Texts

Please use this identifier to cite or link to this item: http://hdl.handle.net/10045/137204

Información del item - Informació de l'item - Item information
Title:	Overview of Rest-Mex at IberLEF 2023: Research on Sentiment Analysis Task for Mexican Tourist Texts
Other Titles:	Resumen de la tarea Rest-Mex en IberLEF 2023: Investigaci´on sobre An´alisis de Sentimiento para Textos Tur´ısticos Mexicanos
Authors:	Álvarez-Carmona, Miguel Á. \| Díaz-Pacheco, Ángel \| Aranda, Ramón \| Rodríguez-González, Ansel Y. \| Muñiz-Sánchez, Victor \| López-Monroy, A. Pastor \| Sanchez-Vega, Fernando \| Bustio-Martínez, Lázaro
Keywords:	Rest-Mex 2023 \| Sentiment Analysis \| Clustering \| Mexican Tourist Text \| Análisis de sentimientos \| Agrupación de textos \| Textos Turísticos
Issue Date:	Sep-2023
Publisher:	Sociedad Española para el Procesamiento del Lenguaje Natural
Citation:	Procesamiento del Lenguaje Natural. 2023, 71: 425-436. https://doi.org/10.26342/2023-71-34
Abstract:	This paper presents the framework and results of the Rest-Mex task at IberLEF 2023, focusing on sentiment analysis and text clustering of tourist texts. The study primarily focuses on texts related to tourist destinations in Mexico, although this edition included data from Cuba and Colombia for the first time. The sentiment analysis task aims to predict the polarity of opinions expressed by tourists, classifying the type of place visited, whether it’s a tourist attraction, hotel, or restaurant, as well as the country it is located in. On the other hand, the text clustering task aims to classify news articles related to tourism in Mexico. For both tasks, corpora were built using Spanish opinions extracted from TripAdvisor and news articles from Mexican media. This article compares and discusses the results obtained by the participants in both sub-tasks. Additionally, a method is proposed to measure the easiness of a multi-class text classification corpus, along with an approach for system selection in a possible late fusion scheme. \| Este artículo presenta el marco y los resultados de la tarea Rest-Mex en IberLEF 2023, que se enfoca en el análisis de sentimiento y agrupamiento de textos turísticos. El estudio se centra principalmente en textos relacionados con destinos turísticos en México, aunque esta edición incluyó datos de Cuba y Colombia por primera vez. La tarea de análisis de sentimiento tiene como objetivo predecir la polaridad de opiniones expresadas por turistas, clasificando el tipo de lugar visitado, ya sea un atractivo turístico, un hotel o un restaurante, así como el país en el que se encuentra. Por otro lado, la tarea de agrupamiento de textos busca clasificar noticias relacionadas con el turismo en México. Para ambas tareas, se construyeron corpus utilizando opiniones en español extraídas de TripAdvisor y noticias de medios mexicanos. En este artículo, se comparan y discuten los resultados obtenidos por los participantes en ambas sub tareas. Además, se propone un método para medir la facilidad de un corpus de clasificación textual multi-clase, así como un enfoque para la selección de sistemas en un posible esquema de fusión tardía.
Sponsor:	The authors thank the Mexican Academy of Tourism Research (AMIT) for their support of the project ”Creation of a labeled database related to tourist destinations for training artificial intelligence models for classifying relevant topics” through the call ”I Research Projects 2022”, which originated this work.
URI:	http://hdl.handle.net/10045/137204
ISSN:	1135-5948
DOI:	10.26342/2023-71-34
Language:	eng
Type:	info:eu-repo/semantics/article
Rights:	© Sociedad Española para el Procesamiento del Lenguaje Natural. Distribuido bajo Licencia Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0
Peer Review:	si
Publisher version:	https://doi.org/10.26342/2023-71-34
Appears in Collections:	Procesamiento del Lenguaje Natural - Nº 71 (2023)

Files in This Item:

Files in This Item:
File	Description	Size	Format
PLN_71_34.pdf		1,14 MB	Adobe PDF	Open Preview Close preview

See citations in Google Scholar

Show full item record

This item is licensed under a Creative Commons License