iGRPO: SELF-FEEDBACK-DRIVEN L | Pangram Labs