BayesBench: Evaluating LLM Belief Trajectories Under Multi-Turn Evidence Accumulation | AIChainDay