内容简介:||
Acme: A research framework for reinforcement learning
|| Documentation | Agents | Examples | Paper
Acme is a library of reinforcement learning (RL) agents and agent building blocks. Acme strives to expose simple, efficient, and readable agents, that serve both as reference implementations of popular algorithms and as strong baselines, while still providing enough flexibility to do novel research. The design of Acme also attempts to provide multiple points of entry to the RL problem at differing levels of complexity.
Overview
At the highest level Acme exposes a number of agents which can be used simply as follows:
import acme # Create an environment and an actor. environment = ... actor = ... # Run the environment loop. loop = acme.EnvironmentLoop(environment, actor) loop.run()
Acme also tries to maintain this level of simplicity while either diving deeper into the agent algorithms or by using them in more complicated settings. An overview of Acme along with more detailed descriptions of its underlying components can be found by referring to the documentation .
For a quick start, take a look at the more detailed working code examples found in the examples subdirectory, which also includes a tutorial notebook to get you started. And finally, for more information on the various agent implementations available take a look at the agents subdirectory along with the README.md associated with each agent.
Installation
We support Python 3.6 and 3.7.
To install acme core:
# Install Acme core dependencies. pip install dm-acme # Install Reverb, our replay backend. pip install dm-acme[reverb]
To install dependencies for our JAX/TensorFlow-based agents:
pip install dm-acme[tf] # and/or pip install dm-acme[jax]
Finally, to install environments (gym, dm_control, bsuite):
pip install dm-acme[envs]
Citing Acme
If you use Acme in your work, please cite the accompanying technical report :
@article{hoffman2020acme,
title={Acme: A Research Framework for Distributed Reinforcement Learning},
author={Matt Hoffman and
Bobak Shahriari and
John Aslanides and
Gabriel Barth-Maron and
Feryal Behbahani and
Tamara Norman and
Abbas Abdolmaleki and
Albin Cassirer and
Fan Yang and
Kate Baumli and
Sarah Henderson and
Alex Novikov and
Sergio Gómez Colmenarejo and
Serkan Cabi and
Caglar Gulcehre and
Tom Le Paine and
Andrew Cowie and
Ziyu Wang and
Bilal Piot and
Nando de Freitas},
year={2020},
journal={arXiv preprint arXiv:2006.00979},
url={https://arxiv.org/abs/2006.00979},
}
以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持 码农网
猜你喜欢:本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们。
实战Linux编程精髓
罗宾斯 / 中国电力出版社 / 2005-7 / 59.80元
编写应用软件,特别是那些比较重要的软件,毫无疑问要涉及到系统调用。在UNIX/Linux环境下编程更是如此。要想编写优秀的软件,就必须熟悉这些系统调用的方方面面。通过阅读这本书,你能够快速地掌握这些重要技术,以构建严谨的Linux软件。全书主要分为三大部分:第一部分讨论了基本的编程问题,包括Linux编程环境、基本的文件和进程管理与操作、内存操作,还介绍了一些基本的库接口。第二部分比较深入地讨论了......一起来看看 《实战Linux编程精髓》 这本书的介绍吧!
随机密码生成器
多种字符组合密码
RGB HSV 转换
RGB HSV 互转工具