Comparison of on-policy deep reinforcement learning A2C with off-policy DQN in irrigation optimization : a case study at a site in Portugal See original record