Reinforcement Learning in Minesweeper

Abstract

Minesweeper is a famous puzzle game involving a single player, requiring them to clear a board with hidden mines and numerical clues indicating the number of mines in the neighbourhood. We have implemented the Q learning and deep Q learning algorithms, ran several experiments on the reward structures, tuned hyperparameters, trained two final agents, and achieved different levels of success. Both the agents performed substantially better than a baseline random agent. Given the limitation of training time, the Q learning agent performed better in average reward and board completion rate, but the deep Q learning agent had a higher winning rate. The latter is likely to perform better if trained for longer, and is ultimately better suited for larger boards with continuous state spaces due to its ability to predict best actions for unseen states.

Acknowledgments

Built in collaboration with J. Dudhat, N. Hird, S. Kosman, S. Theriault, and Y. Zhao
Designed under the instruction of N. Mehta

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
report		report
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
deep-q-learning.ipynb		deep-q-learning.ipynb
q-learning.ipynb		q-learning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning in Minesweeper

Abstract

Acknowledgments

About

Languages

License

zakwht/minesweeper

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning in Minesweeper

Abstract

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Languages