Inverse_q_learning_world_model