BehaviorBench: Modeling Real-World User Decisions from Behavioral Traces
본문 미리보기
arXiv:2606.02798v1 Announce Type: new Abstract: Many decision-support settings require systems that adapt to individual users, but evaluation data for this problem remain limited. Existing benchmarks for user understanding often rely on simulated users or model-generated behavior, even though recent work cautions that model-based simulations can diverge systematically from human behavior. We introduce \textsc{BehaviorBench}, a benchmark for evaluating personalized decision modeling from real-wo
전체 내용이 궁금하다면?
원문을 직접 읽어보세요