NormAct: A Benchmark for Hidden Social Norm Compliance in Embodied Planning | AIChainDay

AI2026년 6월 29일AI 점수: 98%

NormAct: A Benchmark for Hidden Social Norm Compliance in Embodied Planning

출처:arXiv cs.AI

✨ AI 인사이트

🧑‍💻 개발자

1.체화 플래닝의 숨은 사회규범 준수를 평가하는 벤치마크 NormAct 공개
2.GPT-5.4·Claude Opus 4.7·Gemini 3 Pro: 명시 목표 67.3% vs 규범 준수 26.4%
3.격차는 지식 부족이 아닌 맥락 내 규범 활성화·접지 실패에서 기인
4.사전 규범 단서 생성기 NormPerceptor로 과제 성공률 24.2%→46.7% 향상

💡

왜 중요한가?

최신 MLLM들이 명시적 목표는 67.3% 달성해도 암묵적 사회규범은 26.4%만 지킨다는 격차를 정량화하고, 사전 규범 단서 생성만으로 성공률을 두 배 가까이 끌어올려 체화 에이전트의 규범 접지 중요성을 보였다.

🏷️ 언급 프로젝트

NormAct NormPerceptor GPT-5.4

본문 미리보기

arXiv:2606.27826v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) are increasingly deployed as embodied planners in egocentric environments, where task success requires not only achieving instructed goals but also acting in socially appropriate ways. While explicit goals may render certain actions optimal, implicit social norms often impose hidden constraints. Existing evaluations typically focus on explicit goal achievement or direct norm knowledge, seldom assessing whet

전체 내용이 궁금하다면?

원문을 직접 읽어보세요

원문 보기

#MLLM#임베디드 AI#사회 규범#AI 행동#AI 안전

3시간 전

AI-Model Network: Concept, Current State and Future

arXiv:2606. 27382v1 Announce Type: new Abstract: While the primary function of computers lies in computation and processing, the core value of the Internet is rooted in sharing and collaboration. Computers create the Internet, and the Internet empowers the value of computers. The rapid development o

📰미디어arXiv cs.AI

원문

NormAct: A Benchmark for Hidden Social Norm Compliance in Embodied Planning

본문 미리보기

관련 글

AI-Model Network: Concept, Current State and Future

When Does Personality Composition Matter for Multi-Agent LLM Teams?

Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

Odyssey: Constructing Verifiable Local Truth-Preserving Foundation Models