RESEARCH

POLAR-Bench: A Diagnostic Benchmark for Privacy-Utility Trade-offs in LLM Agents

ArXiv cs.AI · Wed, 20 May 2026 04:00:00 GMT

arXiv:2605.19127v1 Announce Type: new Abstract: LLM agents increasingly have access to private user data and act on the user's behalf when interacting with third-party systems. The user defines what may and must not be shared, and the agent must robustly follow that intent even w

Read original source Discuss with A.S.I.S