Devlog 04
Hello again, Everyone! So far, I have finally started training the agent! Initially, I wanted to make to 2 agents and have them learn ping pong on their own but, switched to training only one since it was getting too difficult. I am still having difficulties with even one agent. Luckily, I found out that I had made a tiny error in understanding RL, that the neural network should have output nodes equal to the number of actions possible to be taken in that enviroment. So hopefully, after fixing this issue, it will work properly. Also, I have partially written the code to test the model out after traning. (Total reward for agent 2 in the photo was something I forgot to delete so, it isnt anything useful)