Project Auto-World: Towards Automated Benchmarking of Neural Relational Reasoners
본문 미리보기
arXiv:2606.24965v1 Announce Type: new Abstract: Reasoning about relational structures remains a significant challenge for neural models, particularly when they must systematically apply learned knowledge to problem instances that are harder than those seen in training. Progress is hampered by the difficulty of evaluating such generalization, since a priori, it is rarely clear what makes an instance hard. We study how this issue can be addressed by using large language models (LLMs) to automate
전체 내용이 궁금하다면?
원문을 직접 읽어보세요