Has anyone got any tips on improving my reinforcement learning algorithm?
pic related. I have to wait over 6 hours just to see if my hyper parameter tuning is working. If it fails, then I tweak and wait another 6 hours... very slow.
I'm using PPO btw.
pic related. I have to wait over 6 hours just to see if my hyper parameter tuning is working. If it fails, then I tweak and wait another 6 hours... very slow.
I'm using PPO btw.
