Model-Based-RL

AlphaZero

This is the final trained Network in play against a Human Opponent.

Play

To play against the Algorithm, clone this repository on your local machine, make sure you have python 3 or higher as well as PyTorch installed.

Then Run the Python Script Play.py in the AlphaZero Folder

Overview

AlphaZero uses a Neural Network as a value approximator, this Network works in Conjunction with Monte Carlo Tree Search (MCTS)

Network Architecture

The Exact architecture can be found in Nets.py

The Graph Below shows how the Networks Perform Compared to their previous networks, during training:

This graph implies that, except for network number 5 (compared to network number 4), there was an improvement every network.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
AlphaZero		AlphaZero
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Model-Based-RL

AlphaZero

Play

Overview

Network Architecture

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

IvLabs/model-based-RL

Folders and files

Latest commit

History

Repository files navigation

Model-Based-RL

AlphaZero

Play

Overview

Network Architecture

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages