A study of the effect of noise and occlusion on the accuracy of convolutional neural networks applied to 3D object recognition

Garcia-Garcia, Alberto; Garcia-Rodriguez, Jose; Orts-Escolano, Sergio; Oprea, Sergiu; Gomez-Donoso, Francisco; Cazorla, Miguel

A study of the effect of noise and occlusion on the accuracy of convolutional neural networks applied to 3D object recognition

Empreu sempre aquest identificador per citar o enllaçar aquest ítem http://hdl.handle.net/10045/72633

Información del item - Informació de l'item - Item information
Títol:	A study of the effect of noise and occlusion on the accuracy of convolutional neural networks applied to 3D object recognition
Autors:	Garcia-Garcia, Alberto \| Garcia-Rodriguez, Jose \| Orts-Escolano, Sergio \| Oprea, Sergiu \| Gomez-Donoso, Francisco \| Cazorla, Miguel
Grups d'investigació o GITE:	Informática Industrial y Redes de Computadores \| Robótica y Visión Tridimensional (RoViT)
Centre, Departament o Servei:	Universidad de Alicante. Departamento de Tecnología Informática y Computación \| Universidad de Alicante. Departamento de Ciencia de la Computación e Inteligencia Artificial \| Universidad de Alicante. Instituto Universitario de Investigación Informática
Paraules clau:	Deep learning \| 3D object recognition \| Convolutional neural networks \| Noise \| Occlusion \| Caffe
Àrees de coneixement:	Arquitectura y Tecnología de Computadores \| Ciencia de la Computación e Inteligencia Artificial
Data de publicació:	de novembre-2017
Editor:	Elsevier
Citació bibliogràfica:	Computer Vision and Image Understanding. 2017, 164: 124-134. doi:10.1016/j.cviu.2017.06.006
Resum:	In this work, we carry out a study of the effect of adverse conditions, which characterize real-world scenes, on the accuracy of a Convolutional Neural Network applied to 3D object class recognition. Firstly, we discuss possible ways of representing 3D data to feed the network. In addition, we propose a set of representations to be tested. Those representations consist of a grid-like structure (fixed and adaptive) and a measure for the occupancy of each cell of the grid (binary and normalized point density). After that, we propose and implement a Convolutional Neural Network for 3D object recognition using Caffe. At last, we carry out an in-depth study of the performance of the network over a 3D CAD model dataset, the Princeton ModelNet project, synthetically simulating occlusions and noise models featured by common RGB-D sensors. The results show that the volumetric representations for 3D data play a key role on the recognition process and Convolutional Neural Network can be considerably robust to noise and occlusions if a proper representation is chosen.
Patrocinadors:	This work has been supported by the Spanish Government DPI2013-40534-R grant for the SIRMAVED project, also supported with FEDER funds. This work has also been funded by the grant “Ayudas para Estudios de Máster e Iniciación a la Investigación” from the University of Alicante.
URI:	http://hdl.handle.net/10045/72633
ISSN:	1077-3142 (Print) \| 1090-235X (Online)
DOI:	10.1016/j.cviu.2017.06.006
Idioma:	eng
Tipus:	info:eu-repo/semantics/article
Drets:	© 2017 Elsevier Inc.
Revisió científica:	si
Versió de l'editor:	http://dx.doi.org/10.1016/j.cviu.2017.06.006
Apareix a la col·lecció:	INV - I2RC - Artículos de Revistas INV - RoViT - Artículos de Revistas INV - AIA - Artículos de Revistas

Arxius per aquest ítem:

Arxius per aquest ítem:
Arxiu	Descripció	Tamany	Format
2017_Garcia-Garcia_etal_CompVisImageUnderst_final.pdf	Versión final (acceso restringido)	2,74 MB	Adobe PDF	Obrir Sol·licitar una còpia
2017_Garcia-Garcia_etal_CompVisImageUnderst_preprint.pdf	Preprint (acceso abierto)	3,62 MB	Adobe PDF	Obrir Vista prèvia Tancar vista prèvia

Veure citacions a Google Académic

Mostrar el registre complet de l'ítem

Tots els documents dipositats a RUA estan protegits per drets d'autors. Alguns drets reservats.