Meta Launches LlamaFirewall Framework to Stop AI Jailbreaks, Injections, and Insecure Code

Meta Launches LlamaFirewall Framework to Stop AI Jailbreaks, Injections, and Insecure Code
Summary: Meta has launched LlamaFirewall, an open-source framework aimed at enhancing the cybersecurity of AI systems against threats such as prompt injection and insecure coding. This framework features three primary guardrails: PromptGuard 2, Agent Alignment Checks, and CodeShield. Additionally, Meta introduced updates to LlamaGuard and CyberSecEval, along with a new program called Llama for Defenders to assist organizations in tackling AI-related security issues.

Affected: AI systems and organizations utilizing AI technologies

Keypoints :

  • LlamaFirewall incorporates real-time detection and prevention mechanisms for various cyber threats.
  • CyberSecEval 4 introduces AutoPatchBench for evaluating AI’s ability to automatically fix coding vulnerabilities.
  • New Llama for Defenders program offers early access to AI solutions for tackling security challenges related to scams and phishing.

Source: https://thehackernews.com/2025/04/meta-launches-llamafirewall-framework.html