A case study of evaluating AI agents on a neuroscience data-to-discovery pipeline | AIChainDay