# Deep Reinforcement Learning with a Natural Language Action Space (@ ACL 2016)

### Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng and Mari Ostendorf

This paper studies Reinforcement Learning in environments where states and actions are given in unbounded natural language. They use text-based games to test their architectures. The architecture they propose is intuitive: one uses distinct encoders for both the state and action, and then uses the dot product between both embeddings to estimate the $Q$ value of the state. This makes the encoders align states with good actions in embedding space.