A stochastic approach to median string computation

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/9689
Registro completo de metadatos
Registro completo de metadatos
Campo DCValorIdioma
dc.contributorReconocimiento de Formas e Inteligencia Artificialen
dc.contributor.authorOlivares Rodríguez, Cristian-
dc.contributor.authorOncina, Jose-
dc.contributor.otherUniversidad de Alicante. Departamento de Lenguajes y Sistemas Informáticosen
dc.date.accessioned2009-02-19T12:15:14Z-
dc.date.available2009-02-19T12:15:14Z-
dc.date.created2008-
dc.date.issued2008-
dc.identifier.citationOLIVARES RODRÍGUEZ, Cristian; ONCINA CARRATALÁ, Jose. "A stochastic approach to median string computation". En: Structural, Syntactic, and Statistical Pattern Recognition : joint IAPR International Workshop, SSPR & SPR 2008, Orlando, USA, December 4-6, 2008 : proceedings. Berlin : Springer, 2008. (Lecture Notes in Computer Science; 5342/2008). ISBN 978-3-540-89688-3, pp. 431-440en
dc.identifier.isbn978-3-540-89688-3-
dc.identifier.issn0302-9743 (Print)-
dc.identifier.issn1611-3349 (Online)-
dc.identifier.urihttp://hdl.handle.net/10045/9689-
dc.description.abstractDue to its robustness to outliers, many Pattern Recognition algorithms use the median as a representative of a set of points. A special case arises in Syntactical Pattern Recognition when the points (prototypes) are represented by strings. However, when the edit distance is used, finding the median becomes a NP-Hard problem. Then, either the search is restricted to strings in the data (set-median) or some heuristic approach is applied. In this work we use the (conditional) stochastic edit distance instead of the plain edit distance. It is not yet known if in this case the problem is also NP-Hard so an approximation algorithm is described. The algorithm is based on the extension of the string structure to multistrings (strings of stochastic vectors where each element represents the probability of each symbol) to allow the use of the Expectation Maximization technique. We carry out some experiments over a chromosomes corpus to check the efficiency of the algorithm.en
dc.description.sponsorshipSpanish CICyT, through project DPI2006-15542-C04-01. The IST Program of the European Community, under the PASCAL Network of Excellence, IST–2002-506778. The program CONSOLIDER INGENIO 2010 (CSD2007-00018).en
dc.languageengen
dc.publisherSpringer Berlin / Heidelbergen
dc.rightsThe original publication is available at www.springerlink.comen
dc.subjectMedian stringen
dc.subjectMultistringen
dc.subjectPattern recognitionen
dc.subjectStochastic edit distanceen
dc.subject.otherLenguajes y Sistemas Informáticosen
dc.titleA stochastic approach to median string computationen
dc.typeinfo:eu-repo/semantics/articleen
dc.peerreviewedsien
dc.identifier.doi10.1007/978-3-540-89689-0_47-
dc.relation.publisherversionhttp://dx.doi.org/10.1007/978-3-540-89689-0_47-
dc.rights.accessRightsinfo:eu-repo/semantics/restrictedAccess-
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/FP7/216886-
Aparece en las colecciones:INV - GRFIA - Artículos de Revistas
Investigaciones financiadas por la UE

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
Thumbnailmedian.pdfVersión final (acceso restringido)415,96 kBAdobe PDFAbrir    Solicitar una copia


Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.