Automatic counter-narrative generation for hate speech in Spanish

Vallecillo-Rodríguez, M. Estrella; Montejo Ráez, Arturo; Martín Valdivia, María Teresa

Automatic counter-narrative generation for hate speech in Spanish

Empreu sempre aquest identificador per citar o enllaçar aquest ítem http://hdl.handle.net/10045/137174

Información del item - Informació de l'item - Item information
Títol:	Automatic counter-narrative generation for hate speech in Spanish
Títol alternatiu:	Generación automática de contranarrativas para discursos de odio en español
Autors:	Vallecillo-Rodríguez, M. Estrella \| Montejo Ráez, Arturo \| Martín Valdivia, María Teresa
Paraules clau:	Spanish counter-narrative generation \| Hate speech \| Natural language generation \| Few-shot learning \| Generación de contranarrativas en español \| Discurso del odio \| Generación de lenguaje natural \| Aprendizaje con pocos ejemplos
Data de publicació:	de setembre-2023
Editor:	Sociedad Española para el Procesamiento del Lenguaje Natural
Citació bibliogràfica:	Procesamiento del Lenguaje Natural. 2023, 71: 227-245. https://doi.org/10.26342/2023-71-18
Resum:	This paper analyzes the use of language models to automatically generate counter-narratives for hate speech in Spanish. Despite the existence of a few studies in English and other languages, no previous work has explored this topic focused on Spanish. The article shows that the use of GPT-3 outperforms other models in generating non-offensive and informative counter-narratives, which sometimes present compelling arguments. We have used few-shot learning algorithms applying different prompt strategies and analyzing the results for each of them. Additionally, a new corpus called CONAN-SP, which consists of 238 pairs of hate speech and counter-narratives in Spanish, has been made available to the research community to facilitate further investigations in this area. These findings highlight the potential of language models to combat hate speech in Spanish by counter-narrative generation. \| Este trabajo analiza el uso de modelos lingüísticos para generar automáticamente contranarrativas al discurso del odio en español. A pesar de la existencia de algunos estudios en inglés y otros idiomas, ningún trabajo previo ha explorado este tema centrado en el español. El artículo muestra que el uso de GPT-3 supera a otros modelos en la generación de contranarrativas no ofensivas e informativas incluyendo en ocasiones argumentos convincentes. Hemos utilizado diferentes algoritmos de few-shot learning aplicando varias estrategias de prompting y analizando los resultados para cada una de ellas. Además, se ha puesto a disposición de la comunidad investigadora un nuevo corpus llamado CONAN-SP, que consta de 238 pares de discursos de odio y contranarrativas en español, para facilitar nuevas investigaciones en este ámbito. Estos resultados ponen de relieve el potencial de los modelos del lenguaje para combatir el discurso de odio en español mediante la generación de contranarrativas.
Patrocinadors:	This work has been partially supported by Project CONSENSO (PID2021-122263OB-C21), Project MODERATES (TED2021-130145B-I00) and Project SocialTox (PDC2022-133146-C21) funded by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR, Project PRECOM (SUBV-00016) funded by Ministerio de Consumo and WeLee project (1380939, FEDER Andalucía 2014-2020) funded by the Andalusian Regional Government.
URI:	http://hdl.handle.net/10045/137174
ISSN:	1135-5948
DOI:	10.26342/2023-71-18
Idioma:	eng
Tipus:	info:eu-repo/semantics/article
Drets:	© Sociedad Española para el Procesamiento del Lenguaje Natural. Distribuido bajo Licencia Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0
Revisió científica:	si
Versió de l'editor:	https://doi.org/10.26342/2023-71-18
Apareix a la col·lecció:	Procesamiento del Lenguaje Natural - Nº 71 (2023)

Arxius per aquest ítem:

Arxius per aquest ítem:
Arxiu	Descripció	Tamany	Format
PLN_71_18.pdf		1,25 MB	Adobe PDF	Obrir Vista prèvia Tancar vista prèvia

Veure citacions a Google Académic

Mostrar el registre complet de l'ítem

Aquest ítem està subjecte a una llicència de Creative Commons Llicència Creative Commons