PRINCIPLED RL FOR DIFFUSION L | Pangram Labs