MirBot: A collaborative object recognition system for smartphones using convolutional neural networks
Empreu sempre aquest identificador per citar o enllaçar aquest ítem
http://hdl.handle.net/10045/74809
Títol: | MirBot: A collaborative object recognition system for smartphones using convolutional neural networks |
---|---|
Autors: | Pertusa, Antonio | Gallego, Antonio-Javier | Bernabeu, Marisa |
Grups d'investigació o GITE: | Reconocimiento de Formas e Inteligencia Artificial |
Centre, Departament o Servei: | Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos |
Paraules clau: | Object recognition | Image datasets | Convolutional neural networks | Transfer learning | Multimodality | Human computer interaction |
Àrees de coneixement: | Lenguajes y Sistemas Informáticos |
Data de publicació: | 7-de juny-2018 |
Editor: | Elsevier |
Citació bibliogràfica: | Neurocomputing. 2018, 293: 87-99. doi:10.1016/j.neucom.2018.03.005 |
Resum: | MirBot is a collaborative application for smartphones that allows users to perform object recognition. This app can be used to take a photograph of an object, select the region of interest and obtain the most likely class (dog, chair, etc.) by means of similarity search using features extracted from a convolutional neural network (CNN). The answers provided by the system can be validated by the user so as to improve the results for future queries. All the images are stored together with a series of metadata, thus enabling a multimodal incremental dataset labeled with synset identifiers from the WordNet ontology. This dataset grows continuously thanks to the users’ feedback, and is publicly available for research. This work details the MirBot object recognition system, analyzes the statistics gathered after more than four years of usage, describes the image classification methodology, and performs an exhaustive evaluation using handcrafted features, neural codes, different transfer learning techniques, PCA compression and metadata, which can be used to improve the image classifier results. The app is freely available at the Apple and Google Play stores. |
Patrocinadors: | This work was supported by the TIMUL project (TIN2013- 48152-C2-1-R) and the University Institute for Computing Research (IUII) from the University of Alicante. |
URI: | http://hdl.handle.net/10045/74809 |
ISSN: | 0925-2312 (Print) | 1872-8286 (Online) |
DOI: | 10.1016/j.neucom.2018.03.005 |
Idioma: | eng |
Tipus: | info:eu-repo/semantics/article |
Drets: | © 2018 Elsevier B.V. |
Revisió científica: | si |
Versió de l'editor: | https://doi.org/10.1016/j.neucom.2018.03.005 |
Apareix a la col·lecció: | INV - GRFIA - Artículos de Revistas |
Arxius per aquest ítem:
Arxiu | Descripció | Tamany | Format | |
---|---|---|---|---|
2018_Pertusa_etal_Neurocomputing_final.pdf | Versión final (acceso restringido) | 1,97 MB | Adobe PDF | Obrir Sol·licitar una còpia |
2018_Pertusa_etal_Neurocomputing_preprint.pdf | Preprint (acceso abierto) | 5,64 MB | Adobe PDF | Obrir Vista prèvia |
Tots els documents dipositats a RUA estan protegits per drets d'autors. Alguns drets reservats.