Recent Posts

Welcome to my blog.

Kelsey Is Cute

17 September 2019

ultimate tic-tac-toe gif

Figure 1: Kelsey being cute.

Temporal Difference Learning For Ultimate Tic Tac Toe

04 June 2019

This post describes the implementation of temporal difference learning that can be found on my github. This amazingly simple algorithm is able to learn entirely through self-play without any human knowledge, except for the rules of the game. By way of example, we will be training the algorithm to play ultimate tic-tac-toe, but the same algorithm can be applied to almost any other game with varying degrees of success. This post will assume some familiarity with machine learning and reinforcement learning concepts, and should be accessible if you understand the basics of supervised learning with neural networks.