How to make Tic Tac Toe game using Reinforcement learning in Python [on hold]

172
March 01, 2018, at 4:33 PM

I googled for 40 hours, but can't move even a single step forward, this is what I have done, so far:

I made a Tic Tac Toe game where user can play against computer (But computer's move is 100% random).

I don't know, what is the first step to make it use reinforcement learning.

  • I know Programming
  • I know Python
  • I know maths
  • I know what Reinforcement Learning is
  • I am ready to learn more, but don't know from where.

I am not asking someone to write the whole code for me. But just give me some idea.

How do I give the bot reward for winning + save all current board's move for future learning.

Or just give me some link, which might help me (Not directly).

I am ready to study reinforcement machine learning for weeks or months, and then try again making this game.

Answer 1

Have you tried udemy, it has some good tutorials on gaming.

Rent Charter Buses Company
READ ALSO
Reconnecting a Bluetooth device using socket library (RFCOMM mode) in python 3?

Reconnecting a Bluetooth device using socket library (RFCOMM mode) in python 3?

I am trying to connect to a Bluetooth GPS unit from a Raspberry Pi3 using the socket library in python 3I am able to connect and get data flowing the first time but if I disconnect and then try reconnecting I get:

307
I heard “support function” method using in stack overflow [on hold]

I heard “support function” method using in stack overflow [on hold]

I heard "support function" method using the stack overflowMy question is "What is support function and how could we use support function?" Give an example to use "support function"

174
Producing combinations of lambda functions compositions

Producing combinations of lambda functions compositions

I am facing a challenging issue in order to make my Python3 code more elegant

183
Sort a list based on the sort indexes of another list [duplicate]

Sort a list based on the sort indexes of another list [duplicate]

This question already has an answer here:

237