References

[1] Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., et al. “TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems.” Software available from tensorflow.org. 2015. ARXIV: 1603.04467.

[2] Achiam, J., Held, D., Tamar, A., and Abbeel, P. “Constrained Policy Optimization.” 2017. ARXIV: 1705.10528.

[3] AlphaStar Team. “AlphaStar: Mastering the Real-Time Strategy Game StarCraft II.” 2019. URL: https://deepmind.com/blog/alphastar-mastering-real-time-strategy-game-starcraft-ii.

[4] Amodei, D., Olah, C., Steinhardt, J., Christiano, P., Schulman, J., and Mané, D. “Concrete Problems in AI Safety.” 2016. ARXIV: 1606.06565.

[5] Anderson, H. L. “Metropolis, Monte Carlo, and the MANIAC.” ...

Get Foundations of Deep Reinforcement Learning: Theory and Practice in Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.