Where Instruction Hierarchy Breaks: Diagnosing and Repairing Failures in Reasoning Language Models | AIChainDay