REINFORCEMENT LEARNING FROM D | Pangram Labs