A Playbook for Securing AI Model Weights Artwork

AI Safety Fundamentals

Listen to resources from the AI Safety Fundamentals courses!

https://aisafetyfundamentals.com/

AI Safety Fundamentals

A Playbook for Securing AI Model Weights

September 18, 2025 • BlueDot Impact

By Sella Nevo et al.

In this report, RAND researchers identify real-world attack methods that malicious actors could use to steal AI model weights. They propose a five-level security framework that AI companies could implement to defend against different threats, from amateur hackers to nation-state operations.

Source:

https://www.rand.org/pubs/research_briefs/RBA2849-1.html

A podcast by BlueDot Impact.

Learn more on the AI Safety Fundamentals website.