Q-LEARNING WITH FinE-GRAINED | Pangram Labs