visual-semantic-navigation-codebase

Overview

This model is a foundational framework for visual-semantic navigation. The corresponding essay is https://arxiv.org/pdf/1810.06543. Specifically, semantic features are extracted using the GloVe model, visual features are extracted using the ResNet18 model, and then cascaded into an A3C reinforcement learning algorithm to generate actions, thus achieving navigation. The model architecture is as follows:

Setting up Python Environment

Create a Python environment using conda:

conda create -n objNav python=3.9.12
conda activate objNav

Install PyTorch:

pip install torch==1.12.1+cu113 torchvision==0.13.1+cu113 torchaudio==0.12.1 --extra-index-url https://download.pytorch.org/whl/cu113

Clone the files:

git clone https://github.com/FUIGUIMURONG/visual-semantic-navigation-codebase.git

Install required dependencies according to requirement.txt:

cd ~/visual-semantic-navigation-codebase #Please make sure to check the download directory.
pip install -r requirements.txt

Download the scene dataset:

Go to the following link and download the compressed file: Scene Dataset

Extract the folder to visual-semantic-navigation-codebase/scene_data directory.

Training

Activate the environment and navigate to the directory:

conda activate objNav
cd ~/visual-semantic-navigation-codebase #Please make sure to check the download directory.

Train:

python main.py --algorithm RL --train_or_test train

Optional arguments:

--tb_dir: directory to save the model, default is runs/

--gpu_ids: GPU IDs to be used, default is 0, -1 is cpu only

--max_RL_episode: maximum training steps, default is 1000000

--n_record_RL: steps for recording training information, default is 100

--RL_save_episodes: steps for model saving, default is 100000

Continue training on a previously trained model:

python main.py --algorithm RL --train_or_test train --If_IL_pretrain True --load_IL_path xxx

Replace xxx with the path of the model to be loaded.

Testing

Activate the environment and navigate to the directory:

conda activate objNav
cd ~/visual-semantic-navigation-codebase #Please make sure to check the download directory.

Test:

python main.py --algorithm RL --train_or_test test --test_RL_load_model xxx

Replace xxx with the path of the model to be loaded.

Optional arguments:

--test_setting:all, seen or unseen, default is all

And you can view the result statistics in the JSON file located in the /visual-semantic-navigation-codebase/res directory.

Others

More parameters:

For more parameters, refer to visual-semantic-navigation-codebase/utils/flag_parser.py.

Try pretrained models:

Go to the following link and download the compressed file: Pretrained Models

Check if the visual-semantic-navigation-codebase/runs directory exists. If it does not exist, create it with the following command:

mkdir ~/visual-semantic-navigation-codebase/runs #Please make sure to check the download directory.

And extract the folder to this directory.

The files with the .pth extension in the folder are the model files. You can use the same command as in the testing section and replace xxx with the path to the .pth file. For example:

python main.py --algorithm RL --train_or_test test --test_RL_load_model ~/visual-semantic-navigation-codebase/runs/RL_2023_11_5_16_9_BaseModel/epoch_600000.pth

Imported the model trained for 600,000 epochs.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
dataset		dataset
models		models
utils		utils
IL.py		IL.py
README.md		README.md
RL.py		RL.py
agent.py		agent.py
controller_offline.py		controller_offline.py
main.py		main.py
optimizer.py		optimizer.py
overview.jpg		overview.jpg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

visual-semantic-navigation-codebase

Overview

Setting up Python Environment

Training

Testing

Others

About

Releases

Packages

Contributors 2

Languages

FUIGUIMURONG/visual-semantic-navigation-codebase

Folders and files

Latest commit

History

Repository files navigation

visual-semantic-navigation-codebase

Overview

Setting up Python Environment

Training

Testing

Others

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages