AbBiBench: A Benchmark for An | Pangram Labs