IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations
본문 미리보기
arXiv:2606.28556v1 Announce Type: new Abstract: Recent advances in large language models and vision-language models have enabled reasoning over multimodal data, offering opportunities for clinical applications such as decision support and triaging. However, existing medical AI benchmarks are fragmented: some support multi-turn dialogues but lack images, while others provide multimodal inputs but focus on single-turn QA tasks. To address this gap, we introduce IMCBench, an image-grounded, multi-
전체 내용이 궁금하다면?
원문을 직접 읽어보세요