Learning to Navigate in Large and Unseen Environments

Environment: We use the AirSim Urban simulated environment to deploy our algorithm.
source: https://www.youtube.com/watch?v=_UgeDCNadHQ

Abstract

The aim of this work is to develop an approach that enables Unmanned Aerial System (UAS) to efficiently learn to navigate in large-scale urban environments and transfer their acquired expertise to novel environments. To achieve this, we propose a meta curriculum training scheme. First, the meta training allows the agent to learn a master policy to generalize across tasks. The resulting model is then fine-tuned on the downstream tasks. We organize the training curriculum in a hierarchical manner, such that the agent is guided from coarse to fine towards the target task. In addition, we introduce Incremental Self-Adaptive Reinforcement learning (ISAR), an algorithm that combines the ideas of incremental learning and meta reinforcement learning (MRL). In contrast to traditional reinforcement learning (RL), which focuses on acquiring a policy for a specific task, MRL aims to learn a policy with fast transfer ability to novel tasks. However, the training process of MRL is time-consuming, whereas our proposed ISAR algorithm achieves faster convergence than conventional MRL algorithm. We evaluate the proposed methodologies in simulated environments and demonstrate that using such training philosophy in conjunction with the ISAR algorithm significantly improves the convergence speed for navigation in large-scale cities and the adaptation proficiency in novel environments.

UAS Urban Navigation

The training result of navigating in AirSim urban environments is shown in 6 example navigation episodes. The UAS starts from random locations and reaches the targets, which are marked by a white car.






Adapt Meta-policy to Unseen Environment

We first conduct meta-training in scene 1 with 25 meta-tasks. Then, we transfer the meta-policy for navigation in scene 2 through fine-tuning. This video shows the fine-tuning results navigating in scene 2.






Robot Indoor Visual Navigation

The real application of our end-to-end visual navigation algorithm in indoor environment. (Supplementary work, not included in this paper.)


BibTeX

BibTex Code Here