Deep Recurrent Q-Learning for Partially Observable MDPs Deep Recurrent Q-Learning for Partially Observable MDPs. Matthew Hausknecht and Peter Stone. In AAAI Fall Symposium on Sequential Decision ...
Artificial Intelligence and Life in 2030. Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram ...
Multiagent Systems: A survey from a machine learning perspective. Peter Stone and Manuela Veloso. Autonomous Robots, 8(3):345–383, July 2000. @Article(MASsurvey, Author="Peter Stone and Manuela Veloso ...
Chapters from the text and a few other readings will be assigned throughout the semester, and the reading should be done before the corresponding class. Copies of the class lecture slides (in ...
A critical bottleneck limiting imitation learning in robotics is the lack ofdata. This problem is more severe in mobile manipulation, where collectingdemonstrations is harder than in stationary ...
Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
Reasoning about Hypothetical Agent Behaviours and their Parameters. Stefano Albrecht and Peter Stone. In Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems ...