Anchor: Mitigating Artifact Drift in Agent Benchmark Generation | AIChainDay