A contextual normalised edit distance
Por favor, use este identificador para citar o enlazar este ítem:
http://hdl.handle.net/10045/8774
Título: | A contextual normalised edit distance |
---|---|
Autor/es: | Higuera, Colin de la | Micó, Luisa |
Grupo/s de investigación o GITE: | Reconocimiento de Formas e Inteligencia Artificial |
Centro, Departamento o Servicio: | Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos | Laboratorie Hubert Curien |
Palabras clave: | Edit distance | Metric | Normalization |
Área/s de conocimiento: | Lenguajes y Sistemas Informáticos | Ciencia de la Computación e Inteligencia Artificial |
Fecha de publicación: | 7-abr-2008 |
Editor: | IEEE |
Cita bibliográfica: | HIGUERA, Colin de la; MICÓ ANDRÉS, Luisa. "A contextual normalised edit distance". En: Data Engineering Workshop, 2008 : ICDEW 2008, IEEE 24th International Conference on. Piscataway, NJ : IEEE, 2008. ISBN 978-1-4244-2161-9, pp. 354-361 |
Resumen: | In order to better fit a variety of pattern recognition problems over strings, using a normalised version of the edit or Levenshtein distance is considered to be an appropriate approach. The goal of normalisation is to take into account the lengths of the strings. We define a new normalisation, contextual, where each edit operation is divided by the length of the string on which the edit operation takes place. We prove that this contextual edit distance is a metric and that it can be computed through an extension of the usual dynamic programming algorithm for the edit distance. We also provide a fast heuristic which nearly always returns the same result and we show over several experiments that the distance obtains good results in classification tasks and has a low intrinsic dimension in comparison with other normalised edit distances. |
Patrocinador/es: | Spanish CICyT for partial support of this work through projects DPI2006-15542-C04-01, the IST Programme of the European Community, under the PASCAL Network of Excellence, IST–2002-506778, the program CONSOLIDER INGENIO 2010 (CSD2007-00018), and the ANR (for program BLAN07-1_184534). |
URI: | http://hdl.handle.net/10045/8774 |
ISBN: | 978-1-4244-2161-9 |
DOI: | 10.1109/ICDEW.2008.4498345 |
Idioma: | eng |
Tipo: | info:eu-repo/semantics/bookPart |
Revisión científica: | si |
Aparece en las colecciones: | INV - GRFIA - Capítulos de Libros |
Archivos en este ítem:
Archivo | Descripción | Tamaño | Formato | |
---|---|---|---|---|
2-_sisap08-2.pdf | 115,76 kB | Adobe PDF | Abrir Vista previa | |
Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.