XSkill: Continual Learning from Experience and Skills in Multimodal Agents
By: Guanyu Jiang, Zhaochen Su, Xiaoye Qu, Yi R. (May)Fung
Published: 2026-03-13
View on arXiv →#cs.AI
Abstract
This paper introduces XSkill, a dual-stream framework enabling multimodal agents to continually learn from visually-grounded task-level skills and action-level experiences without explicit retraining. This approach improves agent performance by enhancing tool-use efficiency and flexibility.