FAITHCOT-BENCH: BENCHMARKING | Pangram Labs