related documents Least squares temporal difference actor-critic methods with applications to robot motion control Conference Proceeding