Two versions of DQN implementation