No.13196435 ViewReplyOriginalReport
Has anyone got any tips on improving my reinforcement learning algorithm?

pic related. I have to wait over 6 hours just to see if my hyper parameter tuning is working. If it fails, then I tweak and wait another 6 hours... very slow.

I'm using PPO btw.