GitHub - clay-curry/flapPy-RL: an RL algorithm solving Flappy Bird. by setting returns R to be the number obstacles cleared upon crashing, q* : S × A → ℝ generates the expectation E(R) from the state-action pair (s, a). experiments support the conjecture that a tabular, n-step Sarsa algorithm converges to a policy π clearing arbitrarily many obstacles (confirmed up to 1,000,000)

clay-curry / flapPy-RL Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

an RL algorithm solving Flappy Bird. by setting returns R to be the number obstacles cleared upon crashing, q* : S × A → ℝ generates the expectation E(R) from the state-action pair (s, a). experiments support the conjecture that a tabular, n-step Sarsa algorithm converges to a policy π clearing arbitrarily many obstacles (confirmed up to 1,000,000)

0 stars 0 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
assets		assets
data		data
.gitignore		.gitignore
config.py		config.py
flappy.py		flappy.py
n_sarsa.py		n_sarsa.py
q_agent.py		q_agent.py
q_agent_flappy.py		q_agent_flappy.py