IMProofBench: Benchmarking AI | Pangram Labs