STABILIZING OFF-POLICY REINFO | Pangram Labs