I work in Deep Reinforcement Learning under the supervision of Professor Yoshua Bengio and Professor Liam Paull