Automatic counter-narrative generation for hate speech in Spanish

Vallecillo-Rodríguez, M. Estrella; Montejo Ráez, Arturo; Martín Valdivia, María Teresa

Automatic counter-narrative generation for hate speech in Spanish

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/137174

Registro completo de metadatos

Registro completo de metadatos
Campo DC	Valor	Idioma
dc.contributor.author	Vallecillo-Rodríguez, M. Estrella	-
dc.contributor.author	Montejo Ráez, Arturo	-
dc.contributor.author	Martín Valdivia, María Teresa	-
dc.date.accessioned	2023-09-14T10:20:36Z	-
dc.date.available	2023-09-14T10:20:36Z	-
dc.date.issued	2023-09	-
dc.identifier.citation	Procesamiento del Lenguaje Natural. 2023, 71: 227-245. https://doi.org/10.26342/2023-71-18	es_ES
dc.identifier.issn	1135-5948	-
dc.identifier.uri	http://hdl.handle.net/10045/137174	-
dc.description.abstract	This paper analyzes the use of language models to automatically generate counter-narratives for hate speech in Spanish. Despite the existence of a few studies in English and other languages, no previous work has explored this topic focused on Spanish. The article shows that the use of GPT-3 outperforms other models in generating non-offensive and informative counter-narratives, which sometimes present compelling arguments. We have used few-shot learning algorithms applying different prompt strategies and analyzing the results for each of them. Additionally, a new corpus called CONAN-SP, which consists of 238 pairs of hate speech and counter-narratives in Spanish, has been made available to the research community to facilitate further investigations in this area. These findings highlight the potential of language models to combat hate speech in Spanish by counter-narrative generation.	es_ES
dc.description.abstract	Este trabajo analiza el uso de modelos lingüísticos para generar automáticamente contranarrativas al discurso del odio en español. A pesar de la existencia de algunos estudios en inglés y otros idiomas, ningún trabajo previo ha explorado este tema centrado en el español. El artículo muestra que el uso de GPT-3 supera a otros modelos en la generación de contranarrativas no ofensivas e informativas incluyendo en ocasiones argumentos convincentes. Hemos utilizado diferentes algoritmos de few-shot learning aplicando varias estrategias de prompting y analizando los resultados para cada una de ellas. Además, se ha puesto a disposición de la comunidad investigadora un nuevo corpus llamado CONAN-SP, que consta de 238 pares de discursos de odio y contranarrativas en español, para facilitar nuevas investigaciones en este ámbito. Estos resultados ponen de relieve el potencial de los modelos del lenguaje para combatir el discurso de odio en español mediante la generación de contranarrativas.	es_ES
dc.description.sponsorship	This work has been partially supported by Project CONSENSO (PID2021-122263OB-C21), Project MODERATES (TED2021-130145B-I00) and Project SocialTox (PDC2022-133146-C21) funded by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR, Project PRECOM (SUBV-00016) funded by Ministerio de Consumo and WeLee project (1380939, FEDER Andalucía 2014-2020) funded by the Andalusian Regional Government.	es_ES
dc.language	eng	es_ES
dc.publisher	Sociedad Española para el Procesamiento del Lenguaje Natural	es_ES
dc.rights	© Sociedad Española para el Procesamiento del Lenguaje Natural. Distribuido bajo Licencia Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0	es_ES
dc.subject	Spanish counter-narrative generation	es_ES
dc.subject	Hate speech	es_ES
dc.subject	Natural language generation	es_ES
dc.subject	Few-shot learning	es_ES
dc.subject	Generación de contranarrativas en español	es_ES
dc.subject	Discurso del odio	es_ES
dc.subject	Generación de lenguaje natural	es_ES
dc.subject	Aprendizaje con pocos ejemplos	es_ES
dc.title	Automatic counter-narrative generation for hate speech in Spanish	es_ES
dc.title.alternative	Generación automática de contranarrativas para discursos de odio en español	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.peerreviewed	si	es_ES
dc.identifier.doi	10.26342/2023-71-18	-
dc.relation.publisherversion	https://doi.org/10.26342/2023-71-18	es_ES
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2021-122263OB-C21	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/TED2021-130145B-I00	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PDC2022-133146-C21	es_ES
Aparece en las colecciones:	Procesamiento del Lenguaje Natural - Nº 71 (2023)

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
PLN_71_18.pdf		1,25 MB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro sencillo

Este ítem está licenciado bajo Licencia Creative Commons