Learning to Navigate in Large and Unseen Environments

Environment: We use the AirSim Urban simulated environment to deploy our algorithm.
source: https://www.youtube.com/watch?v=_UgeDCNadHQ

Two-stage learning framework: The navigation task consists of two phases: meta-training and curriculum fine-tuning. Meta-training allows the agent to learn a master navigation policy. The hierarchical-structured curriculum adapts the meta-policy to the target task. This meta-policy can further be transferred to novel environments.

We first set the agent’s altitude at 300 meters to implement meta-training. Then, we fine-tune the meta-policy with a hierarchical-structured training curriculum. We use ResNet-18 to extract the features of current state and target state. The combined feature is then fed into the policy network to generate the navigation policy and value. Our training strategy outperforms the standard agent trained from scratch at 15 meters.

The illustration of the ISAR framework with adaptive-update step size = 2. During exploration, we learn two types of losses: the interaction loss, and the adaptive loss based on trajectory segment to update the adaptation policy. The results show that our ISAR algorithm demonstrates a significant improvement in convergence speed compared to SAVN (traditional MRL).

The transfer learning process to unseen environments. We first conduct meta-training in scene A, then fine-tune the meta-policy to navigate in scene B. The meta-trained agent transfers rapidly to unseen environments, but has limitations in transferring from urban to wild environments.

UAS Urban Navigation

The training result of navigating in AirSim urban environments is shown in 6 example navigation episodes. The UAS starts from random locations and reaches the targets, which are marked by a white car.

Adapt Meta-policy to Unseen Environment

We first conduct meta-training in scene 1 with 25 meta-tasks. Then, we transfer the meta-policy for navigation in scene 2 through fine-tuning. This video shows the fine-tuning results navigating in scene 2.

Robot Indoor Visual Navigation

The real application of our end-to-end visual navigation algorithm in indoor environment. (Supplementary work, not included in this paper.)

Learning to Navigate in Large and Unseen Environments

Environment: We use the AirSim Urban simulated environment to deploy our algorithm.
source: https://www.youtube.com/watch?v=_UgeDCNadHQ

Abstract

UAS Urban Navigation

Adapt Meta-policy to Unseen Environment

Robot Indoor Visual Navigation

BibTeX

Learning to Navigate in Large and Unseen Environments

Environment: We use the AirSim Urban simulated environment to deploy our algorithm. source: https://www.youtube.com/watch?v=_UgeDCNadHQ

Abstract

UAS Urban Navigation

Adapt Meta-policy to Unseen Environment

Robot Indoor Visual Navigation

BibTeX

Environment: We use the AirSim Urban simulated environment to deploy our algorithm.
source: https://www.youtube.com/watch?v=_UgeDCNadHQ