InsighthubNews
  • Home
  • World News
  • Politics
  • Celebrity
  • Environment
  • Business
  • Technology
  • Crypto
  • Sports
  • Gaming
Reading: Meta starts the llamafirewall framework and stops AI jailbreak, injection, and safe code
Share
Font ResizerAa
InsighthubNewsInsighthubNews
Search
  • Home
  • World News
  • Politics
  • Celebrity
  • Environment
  • Business
  • Technology
  • Crypto
  • Sports
  • Gaming
© 2024 All Rights Reserved | Powered by Insighthub News
InsighthubNews > Technology > Meta starts the llamafirewall framework and stops AI jailbreak, injection, and safe code
Technology

Meta starts the llamafirewall framework and stops AI jailbreak, injection, and safe code

April 30, 2025 3 Min Read
Share
Meta starts the llamafirewall framework and stops AI jailbreak, injection, and safe code
SHARE

Tuesday’s meta was announced llamafirewallAn open source framework designed to protect artificial intelligence (AI) systems, among other things, against new cyber risks such as rapid injection, jailbreak, and unstable code.

According to the company, the framework incorporates three guardrails, including PromptGuard 2, Agent Alignment Check and Codeshield.

PromptGuard 2 is designed to detect direct jailbreak and prompt injection attempts in real time, while agent alignment checks can inspect agent inferences that may be target hijacking and indirect rapid injection scenarios.

Codeshield refers to an online static analysis engine that attempts to prevent AI agents from generating unstable or dangerous code.

“Llamafirewall is built to act as a flexible, real-time guardrail framework for protecting applications with LLM,” the company said in its GitHub description of the project.

“Its architecture is modular, allowing security teams and developers to configure layered defenses ranging from raw input intake to final output actions across simple chat models and complex autonomous agents.”

Alongside Llamafirewall, Meta utilized updated versions of Llamaguard and Cyberseceval to better detect various common types of violation content, each measuring the defense cybersecurity capabilities of AI systems.

Cyberseceval 4 also includes a new benchmark called Autopatchbench. Autopatchbench is designed to assess the capabilities of large-scale language model (LLM) agents and automatically repairs a wide range of C/C++ vulnerabilities identified by an approach known as AI-driven patching.

“Autopatchbench provides a standardized assessment framework for assessing the effectiveness of AI-assisted vulnerability remediation tools,” the company said. “This benchmark is intended to promote a comprehensive understanding of the capabilities and limitations of various AI-driven approaches to fixing fuzzing-based bugs.”

Finally, Meta has launched a new program called Llama to help partner organizations and AI developers shut down their AI solutions to address certain security challenges, including accessing open, early access, and closed AI solutions to detect AI-generated content used in fraud, fraud, and phishing attacks.

See also  How Google's AI unlocks the secrets of dolphin communication

The announcement is to enable WhatsApp to preview a new technology called private processing, allowing users to take advantage of AI capabilities without compromising privacy by offloading requests into a secure, sensitive environment.

“We will continue to work with the security community to audit and improve our architecture and work with researchers to build and enhance private processing before launching it in our products,” Meta said.

Share This Article
Twitter Copy Link
Previous Article Party Animals Coded in April 2025 and How to Redeem Party Animals Coded in April 2025 and How to Redeem
Next Article mm Recover and edit human images with AI

Latest News

mm

AI is giving pets a voice: The future of cat health care begins with one photo

Artificial intelligence is revolutionizing the way we care for animals.…

May 15, 2025
5 BCDR Essentials for Effective Ransom Defense

5 BCDR Essentials for Effective Ransom Defense

Ransomware has evolved into a deceptive, highly tuned, dangerous and…

May 15, 2025
mm

Anaconda launches the first unified AI platform to redefine enterprise-grade AI development

Anaconda Inc., a longtime leader in Python-based data science, has…

May 14, 2025
Microsoft fixed 78 flaws and exploited five zero-days. CVSS 10 bug affects Azure DevOps servers

Microsoft fixed 78 flaws and exploited five zero-days. CVSS 10 bug affects Azure DevOps servers

On Tuesday, Microsoft shipped fixes to address a total of…

May 14, 2025
mm

Why language models are “lost” in conversation

A new paper from Microsoft Research and Salesforce found that…

May 13, 2025

You Might Also Like

North Korean Hackers Spread Malware
Technology

North Korean hacker spreads malware via fake crypto companies and employment interview lures

7 Min Read
New XorDDoS Controller, Infrastructure
Technology

Experts discover new Xordos controllers, infrastructure as malware expands to Docker, Linux and IoT

3 Min Read
WordPress Plugin Vulnerability
Technology

ottokit WordPress Plugin Administrator Creation Vulnerability Vulnerability

3 Min Read
Malware Steal Browser Credentials and Crypto Wallet Data
Technology

Golden Chicken unfolds terrastealerv2 to steal browser credentials and crypto wallet data

5 Min Read
InsighthubNews
InsighthubNews

Welcome to InsighthubNews, your reliable source for the latest updates and in-depth insights from around the globe. We are dedicated to bringing you up-to-the-minute news and analysis on the most pressing issues and developments shaping the world today.

  • Home
  • Celebrity
  • Environment
  • Business
  • Crypto
  • Home
  • World News
  • Politics
  • Celebrity
  • Environment
  • Business
  • Technology
  • Crypto
  • Sports
  • Gaming
  • World News
  • Politics
  • Technology
  • Sports
  • Gaming
  • About us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms of Service
  • About us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms of Service

© 2024 All Rights Reserved | Powered by Insighthub News

Welcome Back!

Sign in to your account

Lost your password?