CSCI 316 Problem Set #3 – Simon D. Levy

Assignment #6

Due on Github 23:59 Wednesday 20 September

The sweep of the pendulum had increased in extent by nearly a yard. As a natural consequence, its velocity was also much greater.

– E.A. Poe, The Pit and the Pendulum

Objectives

1. Expand our Deep Reinforcement Learning toolkit to enable continuous control for robotics.
2. Improve our understanding of Actor/Critic methods

Running on a workstation in the Advanced Lab (Parmly 413)

Because of the computational power needed for DRL, you will want to do this assignment on a computer with a GPU. Here’s what you need to do to get started

USE YOUR ppo.py to have students write pendulum_train.py, pendulum_test.py