Skip to content

Commit

Permalink
moving images
Browse files Browse the repository at this point in the history
  • Loading branch information
jlooper committed Jun 29, 2021
1 parent 0fb02c2 commit 60b1217
Show file tree
Hide file tree
Showing 3 changed files with 6 additions and 6 deletions.
4 changes: 0 additions & 4 deletions 8-Reinforcement/1-QLearning/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,10 +25,6 @@ You can open [the lesson notebook](notebook.ipynb) and walk through this lesson

In this lesson, we will explore the world of **[Peter and the Wolf](https://en.wikipedia.org/wiki/Peter_and_the_Wolf)**, inspired by a musical fairy tale by a Russian composer, [Sergei Prokofiev](https://en.wikipedia.org/wiki/Sergei_Prokofiev). We will use **Reinforcement Learning** to let Peter explore his environment, collect tasty apples and avoid meeting the wolf.

![peter and the wolf](images/peter.png)

> Peter and his friends need to escape the hungry wolf! Image by [Jen Looper](https://twitter.com/jenlooper)
**Reinforcement Learning** (RL) is a learning technique that allows us to learn an optimal behavior of an **agent** in some **environment** by running many experiments. An agent in this environment should have some **goal**, defined by a **reward function**.

## The environment
Expand Down
8 changes: 6 additions & 2 deletions 8-Reinforcement/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,9 +4,9 @@ Reinforcement learning, RL, is seen as one of the basic machine learning paradig

Imagine you have a simulated environment, like the stock market for example. What happens if you impose this or that regulation does it have a positive or negative effect? The whole point is being able to change course if something negative happen, so called _negative reinforcement_ or if it's a positive outcome, to keep building on that, so called _positive reinforcement_.

[![Peter and the Wolf](https://img.youtube.com/vi/Fmi5zHg4QSM/0.jpg)](https://www.youtube.com/watch?v=Fmi5zHg4QSM)
![peter and the wolf](images/peter.png)

> 🎥 Click the image above to listen to Peter and the Wolf by Prokofiev
> Peter and his friends need to escape the hungry wolf! Image by [Jen Looper](https://twitter.com/jenlooper)
## Regional topic: Peter and the Wolf (Russia)

Expand All @@ -15,6 +15,10 @@ Imagine you have a simulated environment, like the stock market for example. Wha
- **Explore** the surrounding area and build an optimal navigation map
- **Learn** how to use a skateboard and balance on it, in order to move around faster.

[![Peter and the Wolf](https://img.youtube.com/vi/Fmi5zHg4QSM/0.jpg)](https://www.youtube.com/watch?v=Fmi5zHg4QSM)

> 🎥 Click the image above to listen to Peter and the Wolf by Prokofiev
## Reinforcement learning

In previous sections, you have seen two examples of machine learning problems:
Expand Down
File renamed without changes

0 comments on commit 60b1217

Please sign in to comment.