🔥 오늘의 핵심
Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games
DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents
AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use
From Parameter Dynamics to Risk Scoring : Quantifying Sample-Level Safety Degradation in LLM Fine-tuning
How Does Thinking Mode Change LLM Moral Judgments? A Controlled Instant-vs-Thinking Comparison Across Five Frontier Models
AI 분석: gemini-2.0-flash