MER-R1：通过慢速思维协同进行多模式情感推理

arXiv:2606.27652v1 公告类型：新摘要：我们发现显式推理不一定会转化为更好的多模态情感识别 (MER) 准确性，尽管它使预测更容易解释。具体来说，对于基于推理的 MLLM，通过触发直接答案的快速思考通常胜过深思熟虑推理后的缓慢思考。我们的实证分析表明，快速思考可以通过更广泛、更自信的预测来提高回忆能力，而缓慢思考则可以通过保守地过滤不正确的类别来提高准确性。 Building on these insights, we propose MER-R1, a reinforcement learning framework that turns slow-fast complementarity into explicit optimization. Dual-objective disentanglement separates recall and precision into two optimization signals, allowing them to be jointly optimized rather than traded off against each other. Slow-fast confidence calibration further aligns the final slow-thinking answer with fast-thinking intuition, strengthening correct emotions while suppressing incorrect ones.通过这种方式，MER-R1将快速思维的以回忆为导向的直觉与慢速思维以精确为导向的选择性结合起来。我们进一步为这种协同作用提供了理论依据，表明它减轻了优化过程中方差引起的干扰。在 MER-UniBench 和 MME-Emotion 上进行的大量实验表明，MER-R1 实现了最先进的性能，并使推理真正有利于情感识别。