Reinforce Agent playing Pixelcopter-PLE-v0

This is a trained model of a Reinforce (i.e Monte Carlo Policy Gradient) agent playing Pixelcopter-PLE-v0 . To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results