BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search

研究方向
出版物
In Proc. of ACL 2026 findings