RL agent playing pong

Sheikh Shaheed Rijwan Wuhan worked on RL agent playing pong

3 months ago

4h 23m logged

Devlog 04
Hello again, Everyone! So far, I have finally started training the agent! Initially, I wanted to make to 2 agents and have them learn ping pong on their own but, switched to training only one since it was getting too difficult. I am still having difficulties with even one agent. Luckily, I found out that I had made a tiny error in understanding RL, that the neural network should have output nodes equal to the number of actions possible to be taken in that enviroment. So hopefully, after fixing this issue, it will work properly. Also, I have partially written the code to test the model out after traning. (Total reward for agent 2 in the photo was something I forgot to delete so, it isnt anything useful)

0

Log in to leave a comment

Sheikh Shaheed Rijwan Wuhan worked on RL agent playing pong

3 months ago

8h 45m logged

Devlog 03
Hello Everyone! so far I have fixed bugs as always, and codded the “agents”. but, now I have a issue while updating their weights. I’ll have to look through the documentation or something for the solution. But, overalll almost all the structure for this project is done except the reward function.

0

Log in to leave a comment

Sheikh Shaheed Rijwan Wuhan worked on RL agent playing pong

3 months ago

3h 54m logged

Devlog 02
Hello everyone! It took me a while but, I finally made the enviroment for trainning the agent!!!
Even though it looks slow but, that is just because of timing in pygame for fps. In trainning it will be much faster.

0

Log in to leave a comment

Sheikh Shaheed Rijwan Wuhan worked on RL agent playing pong

4 months ago

1h 3m logged

Devlog 01
Today I didnt do much. Just made how the enviroment will look. This is important since from this Ican get the agents starting positions, balls position and border positions. Even though I wont make a gui game for now. Since I will make a function to do it internally without gui for training the agent. So it can be faster.

1

0

Log in to leave a comment

0 Followers