Soluciones de navegación inteligente para plataformas robóticas entrenadas en entornos virtuales

Gutiérrez Maestro, Eduardo

dc.contributor.advisor	López Sastre, Roberto Javier
dc.contributor.author	Gutiérrez Maestro, Eduardo
dc.date.accessioned	2019-11-08T17:24:41Z
dc.date.available	2019-11-08T17:24:41Z
dc.date.issued	2019
dc.identifier.uri	http://hdl.handle.net/10017/39971
dc.description.abstract	La navegación visual es la capacidad que tiene un agente autónomo de encontrar su camino en un entorno amplio y complejo basado únicamente en información visual. De hecho, es un problema fundamental en la visión por computador y la robótica. En este proyecto se propone un modelo basado en deep reinforcement learning que es capaz de navegar en una escena para alcanzar un objetivo visual, pero anticipando las posibles colisiones dentro del entorno. Técnicamente, se propone un modelo de tipo map-less, que sigue un método de reinforcement learning conocido como actor-critic, en donde la función de recompensa ha sido diseñada para evitar colisiones. Se expone una evaluación exhaustiva del modelo para el entorno virtual AI2-THOR, donde los resultados muestran que el modelo propuesto: 1) mejora el estado del arte en términos de número de pasos y de colisiones; 2) es capaz de converger más rápido que un modelo que no tiene en cuenta las colisiones, buscando únicamente el camino más corto; y 3) ofrece una interesante capacidad de generalización para alcanzar objetivos visuales que no han sido nunca vistos durante el entrenamiento.	es_ES
dc.description.abstract	Visual navigation is the ability of an autonomous agent to find its way in a large and complex environment based on visual information. It is indeed a fundamental problem in computer vision and robotics. In this project, it is proposed a deep reinforcement learning approach which is able to learn to navigate a scene to reach a given visual target, but anticipating the possible collisions with the environment. Technically, it is proposed a map-less-based model, which follows an actor-critic reinforcement learning method where the reward function has been designed to be collision aware. It is offered a thorough experimental evaluation of our solution in the AI2-THOR virtual environment, where the results show that the proposed method: 1) improves the state of the art in terms of number of steps and collisions; 2) is able to converge faster than a model which does not care about the collisions, simply searching for the shortest paths; and 3) offers an interesting generalization capability to reach visual targets that have never been seen during training.	en
dc.format.mimetype	application/pdf	en
dc.language.iso	spa	en
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	en
dc.subject	Navegación visual	es_ES
dc.subject	Deep reinforcement learning	en
dc.subject	Robótica	es_ES
dc.subject	Visión por ordenador	es_ES
dc.subject	Robotics	en
dc.subject	Computer vision	en
dc.title	Soluciones de navegación inteligente para plataformas robóticas entrenadas en entornos virtuales	es_ES
dc.type	info:eu-repo/semantics/masterThesis	en
dc.subject.eciencia	Telecomunicaciones	es_ES
dc.subject.eciencia	Telecommunication	en
dc.contributor.affiliation	Universidad de Alcalá. Escuela Politécnica Superior	es_ES
dc.type.version	info:eu-repo/semantics/acceptedVersion	en
dc.description.degree	Máster Universitario en Ingeniería de Telecomunicación (M125)	es_ES
dc.rights.accessRights	info:eu-repo/semantics/openAccess	en

Files in this item

Name:: TFM_Gutiérrez_ Maestro_2019.pdf
Size:: 3.889Mb
Format:: PDF
Description:: Trabajo Fin de Máster (TFM)

View/Open

This item appears in the following Collection(s)

TFM - Máster Universitario en Ingeniería de Telecomunicación [40]
TFM - Máster Universitario en Ingeniería de Telecomunicación

Attribution-NonCommercial-NoDerivatives 4.0 Internacional

Este ítem está sujeto a una licencia Creative Commons.