keras实现REINFORCE算法强化学习
keras实现REINFORCE算法强化学习: # Policy Gradient Minimal implementation of Stochastic Policy Gradient Algorithm in Keras ## Pong Agent ![pg](./assets/pg.gif) This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.
文件列表
policy-gradient.tar.gz
(预估有个56文件)
policy-gradient
pg.py
4KB
LICENSE
1KB
assets
pg.gif
1.81MB
score.png
13KB
README.md
262B
.git
logs
HEAD
201B
暂无评论