A new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architecture