search-and-rescue

Final Project for CS 238: Decision Making Under Uncertainty

Contributors: Avi Gupta, Sam Kwok, Ernesto Nam

Overview

In this project, we model the problem of search and rescue (SAR) in wintery conditions on mountainous terrain. If a skier or snowboarder gets stuck or lost on a mountain, an unmanned aerial vehicle (UAV) can be deployed to maximize the efficiency of SAR operations. Rather than being operated by humans, the efficiency of SAR can be improved by autonomous decision making.

MDP

The states in our MDP models are cells in a 2-D grid that represents a 3-D mountain. Each cell state contains a height and obstacle density. With this, we can produce reward and transitions. We leverage Q-Learning and Value Iteration methods to solve the MDP and obtain a policy.

Reward Model

The reward model is: R(sp | s, a) = height(sp) + density(sp) - fuel_cost(s, sp) + found(sp). The boolean "found" represents whether the stranded skier was at cell sp (s-prime). The fuel cost is -3 for ascending, -2 for lateral movement, and -1 for descending.

Transition Model

The transition model is: 0.5 if density(sp) >= 2 else 1. In other words, the UAV transitions with probability 0.5 if the tree density is high enough, representing the possibility of a crash.

Usage:

Generate a mountain model with python3 mountain.py <mountain_size>. Example output: data/3_mountain_data.csv Run q-learning with: python3 q.py <infile>.csv <outfile>.policy Example output: policies/3q.policy Run value iteration with: python3 value-iter.py <infile>.csv <outfile>.policy Example output: policies/3v.policy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

search-and-rescue

Overview

MDP

Reward Model

Transition Model

Usage:

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
policies		policies
README.md		README.md
mountain.py		mountain.py
q.py		q.py
reward.py		reward.py
value-iter.py		value-iter.py

avigpt/search-and-rescue

Folders and files

Latest commit

History

Repository files navigation

search-and-rescue

Overview

MDP

Reward Model

Transition Model

Usage:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages