Breaking Safety Paradox with Feasible Dual Policy Iteration

Published in International Conference on Learning Representations (**ICLR**), 2026