The Bank Said GPT-5.5 Hallucinates Less. The Benchmark Said 86%. Here’s Why They’re Both Right. | AIChainDay