Automatic counter-narrative generation for hate speech in Spanish

Vallecillo-Rodríguez, M. Estrella; Montejo Ráez, Arturo; Martín Valdivia, María Teresa

Automatic counter-narrative generation for hate speech in Spanish

Please use this identifier to cite or link to this item: http://hdl.handle.net/10045/137174

Información del item - Informació de l'item - Item information
Title:	Automatic counter-narrative generation for hate speech in Spanish
Other Titles:	Generación automática de contranarrativas para discursos de odio en español
Authors:	Vallecillo-Rodríguez, M. Estrella \| Montejo Ráez, Arturo \| Martín Valdivia, María Teresa
Keywords:	Spanish counter-narrative generation \| Hate speech \| Natural language generation \| Few-shot learning \| Generación de contranarrativas en español \| Discurso del odio \| Generación de lenguaje natural \| Aprendizaje con pocos ejemplos
Issue Date:	Sep-2023
Publisher:	Sociedad Española para el Procesamiento del Lenguaje Natural
Citation:	Procesamiento del Lenguaje Natural. 2023, 71: 227-245. https://doi.org/10.26342/2023-71-18
Abstract:	This paper analyzes the use of language models to automatically generate counter-narratives for hate speech in Spanish. Despite the existence of a few studies in English and other languages, no previous work has explored this topic focused on Spanish. The article shows that the use of GPT-3 outperforms other models in generating non-offensive and informative counter-narratives, which sometimes present compelling arguments. We have used few-shot learning algorithms applying different prompt strategies and analyzing the results for each of them. Additionally, a new corpus called CONAN-SP, which consists of 238 pairs of hate speech and counter-narratives in Spanish, has been made available to the research community to facilitate further investigations in this area. These findings highlight the potential of language models to combat hate speech in Spanish by counter-narrative generation. \| Este trabajo analiza el uso de modelos lingüísticos para generar automáticamente contranarrativas al discurso del odio en español. A pesar de la existencia de algunos estudios en inglés y otros idiomas, ningún trabajo previo ha explorado este tema centrado en el español. El artículo muestra que el uso de GPT-3 supera a otros modelos en la generación de contranarrativas no ofensivas e informativas incluyendo en ocasiones argumentos convincentes. Hemos utilizado diferentes algoritmos de few-shot learning aplicando varias estrategias de prompting y analizando los resultados para cada una de ellas. Además, se ha puesto a disposición de la comunidad investigadora un nuevo corpus llamado CONAN-SP, que consta de 238 pares de discursos de odio y contranarrativas en español, para facilitar nuevas investigaciones en este ámbito. Estos resultados ponen de relieve el potencial de los modelos del lenguaje para combatir el discurso de odio en español mediante la generación de contranarrativas.
Sponsor:	This work has been partially supported by Project CONSENSO (PID2021-122263OB-C21), Project MODERATES (TED2021-130145B-I00) and Project SocialTox (PDC2022-133146-C21) funded by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR, Project PRECOM (SUBV-00016) funded by Ministerio de Consumo and WeLee project (1380939, FEDER Andalucía 2014-2020) funded by the Andalusian Regional Government.
URI:	http://hdl.handle.net/10045/137174
ISSN:	1135-5948
DOI:	10.26342/2023-71-18
Language:	eng
Type:	info:eu-repo/semantics/article
Rights:	© Sociedad Española para el Procesamiento del Lenguaje Natural. Distribuido bajo Licencia Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0
Peer Review:	si
Publisher version:	https://doi.org/10.26342/2023-71-18
Appears in Collections:	Procesamiento del Lenguaje Natural - Nº 71 (2023)

Files in This Item:

Files in This Item:
File	Description	Size	Format
PLN_71_18.pdf		1,25 MB	Adobe PDF	Open Preview Close preview

See citations in Google Scholar

Show full item record

This item is licensed under a Creative Commons License