**Bibliolater 📚 📜 🖋** @bibliolater@qoto.org · 2025-12-06T01:26:53Z

Bibliolater 📚 📜 🖋 @bibliolater@qoto.org

Bibliolater 📚 📜 🖋 @bibliolater@qoto.org

🖥️ **PropensityBench: Evaluating Latent Safety Risks in Large Language Models via an Agentic Approach**

"_Across open-source and proprietary frontier models, we uncover 9 alarming signs of propensity: models frequently choose high-risk tools when under pressure, despite lacking the capability to execute such actions unaided._"

Sehwag, U.M. et al. (2025) 'PropensityBench: Evaluating latent safety risks in large language models via an agentic approach,' arXiv (Cornell University) [Preprint]. https://doi.org/10.48550/arxiv.2511.20703.

#AI #ArtificialIntelligence #Technology #Tech #LLMS

Dec 06, 2025, 01:26 · · · ·

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…