RL agent playing pong banner

RL agent playing pong

5 devlogs
18h 6m 33s

Hello everyone! Recently, I have been learning RL and wanted to make a project with it, not too complex and not too simple. Ping Pong was the middle ground. Also, making the enviroment hopefully be easier than other enviroments.

This project uses AI

1, Used ChatGPT to check if the methods I am using from Tensorflow are actually real, since VS code didn’t show those in auto complete.
2. To quickly learn Gymnasium and the pong environment from it.

Sheikh Shaheed Rijwan Wuhan

Devlog 04
Hello again, Everyone! So far, I have finally started training the agent! Initially, I wanted to make to 2 agents and have them learn ping pong on their own but, switched to training only one since it was getting too difficult. I am still having difficulties with even one agent. Luckily, I found out that I had made a tiny error in understanding RL, that the neural network should have output nodes equal to the number of actions possible to be taken in that enviroment. So hopefully, after fixing this issue, it will work properly. Also, I have partially written the code to test the model out after traning. (Total reward for agent 2 in the photo was something I forgot to delete so, it isnt anything useful)

Attachment
0
Sheikh Shaheed Rijwan Wuhan

Devlog 03
Hello Everyone! so far I have fixed bugs as always, and codded the “agents”. but, now I have a issue while updating their weights. I’ll have to look through the documentation or something for the solution. But, overalll almost all the structure for this project is done except the reward function.

0
Sheikh Shaheed Rijwan Wuhan

Devlog 02
Hello everyone! It took me a while but, I finally made the enviroment for trainning the agent!!!
Even though it looks slow but, that is just because of timing in pygame for fps. In trainning it will be much faster.

0
Sheikh Shaheed Rijwan Wuhan

Devlog 01
Today I didnt do much. Just made how the enviroment will look. This is important since from this Ican get the agents starting positions, balls position and border positions. Even though I wont make a gui game for now. Since I will make a function to do it internally without gui for training the agent. So it can be faster.

Attachment
0