Interesting research papers I have read (and my notes):
- Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
- Sample Efficient Actor-Critic with Experience Replay
- Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
- Proximal Policy Optimization Algorithms
- Emergence of Locomotion Behaviours in Rich Environments
- High-Dimensional Continuous Control Using Generalized Advantage Estimation
- Trust Region Policy Optimization
- Asynchronous Methods for Deep Reinforcement Learning
- Rainbow - Combining Improvements in Deep Reinforcement Learning
- Prioritized Experience Replay
- Deep Reinforcement Learning with Double Q-learning
- Dueling Network Architectures for Deep Reinforcement Learning
- Deep Recurrent Q-Learning for Partially Observable MDPs
- Playing Atari With Deep Reinforcement Learning
- Extensibility, Safety, and Performance in the SPIN Operating System
- On Micro-Kernel Construction
- Exokernel - An Operating System Architecture for Application-Level Resource Management