ThinkDeception: A Progressive Reinforcement Learning Framework for Interpretable Multimodal Deception Detection | AIChainDay

AI2026년 6월 18일AI 점수: 93%

ThinkDeception: A Progressive Reinforcement Learning Framework for Interpretable Multimodal Deception Detection

출처:arXiv cs.AI

✨ AI 인사이트

🧑‍💻 개발자

1.ThinkDeception: 멀티모달 기만 탐지를 이진 분류가 아닌 명시적 인지 추론 과정으로 전환
2.단계별 멀티모달 Chain-of-Thought 데이터셋을 최초 구축해 기반 모델 학습
3.표준 GRPO와 달리 시청각 일관성 기반 VAC-GRPO와 4단계 난이도 점진 학습 적용
4.주류 벤치마크에서 탐지 정확도와 근거 품질 모두 새로운 SOTA 달성

💡

왜 중요한가?

기존 블랙박스 기만 탐지가 추론 근거를 제시하지 못하던 한계를, MLLM과 단계별 CoT·일관성 보상으로 해석 가능하게 만들어 사기 의도 판별에 투명한 근거를 제공한다.

🏷️ 언급 프로젝트

ThinkDeception VAC-GRPO

본문 미리보기

arXiv:2606.18988v1 Announce Type: new Abstract: Multimodal deception detection is critical for identifying fraudulent intentions, yet existing approaches predominantly rely on end to end black--box paradigms. These methods suffer from a severe lack of interpretability failing to provide transparent reasoning trajectories and struggling to explicitly capture the subtle, cross modal inconsistencies inherent in deceptive behaviors. To transcend these limitations, we propose ThinkDeception, a novel

전체 내용이 궁금하다면?

원문을 직접 읽어보세요

원문 보기