《HandbookofLearningandApproximateDynamicProgramming》,作者JennieSi,AndyBarto,WarrenPowell,DonaldWunschauth.仔细阐述了自适应动态规划,很详细