CORL: Reinforcement Learning of MILP Policies Solved via Branch and Bound
By: Akhil S Anand, Elias Aarekol, Martin Mziray Dalseg, Magnus Stalhane, Sebastien Gros
Published: 2025-12-15
View on arXiv →#cs.AI
Abstract
This paper introduces CORL, a method for reinforcement learning of policies that solve Mixed-Integer Linear Programs (MILPs) using branch and bound algorithms. It addresses the challenges of suboptimal performance in real-world stochastic problems by improving MILP modeling, with significant potential for optimization and decision-making applications.