REFINEBENCH: EVALUATING REFIN | Pangram Labs