Monte-Carlo Tree Search

Implementations of the algorithms described in Munos, R. (2014). From bandits to Monte-Carlo Tree Search: The optimistic principle applied to optimization and planning. Foundations and Trends® in Machine Learning, 7(1), 1-129.

The algorithms are implemented only for finding the maximum of a function defined on [0, 1].

Algorithms implemented:

Section 3: Optimistic optimization with known smoothness
- Deterministic Optimistic Optimization (DOO)
- Stochastic Optimistic Optimization (StoOO)
- Hierarchical Optimistic Optimization (HOO)
Section 4: Simultaneous Optimistic Optimization
- Simultaneous Optimistic Optimization (SOO)
- Stochastic Simultaneous Optimistic Optimization (StoSOO)

Algorithms to implement:

Section 5: Optimistic planning
- Optimistic Planning algorithm (OPD)
- Open Loop Optimistic Planning (OLOP)
- Optimistic planning in MDP (OP-MDP)

Requirements:

Python 3.7
Numpy 1.14
Networkx 2.1

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
MCTS.ipynb		MCTS.ipynb
README.md		README.md
tree.py		tree.py
tree_with_known_smoothness.py		tree_with_known_smoothness.py
tree_with_unknown_smoothness.py		tree_with_unknown_smoothness.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Monte-Carlo Tree Search

About

Releases

Packages

Languages

yunjhongwu/MCTS

Folders and files

Latest commit

History

Repository files navigation

Monte-Carlo Tree Search

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages