Close Menu
  • Home
  • News
  • Business
  • Lifestyle
    • Entertainment
    • Sport
    • Art & Entertainment
  • Travel
  • Tech
  • Others
    • Real Estate
      • Housing
      • Investment
      • Tourism
      • Property
        • Home & Interior
    • Jobs
    • Education
    • Community
  • Hot News
  • Abu Dhabi Week
  • Submit Your Story
X (Twitter)
  • Editorial Policy
  • About Us
  • Contact
X (Twitter) Instagram
Dubai Week
Subscribe
  • Home
  • News
  • Business
  • Lifestyle
    • Entertainment
    • Sport
    • Art & Entertainment
  • Travel
  • Tech
  • Others
    • Real Estate
      • Housing
      • Investment
      • Tourism
      • Property
        • Home & Interior
    • Jobs
    • Education
    • Community
  • Hot News
  • Abu Dhabi Week
  • Submit Your Story
Dubai Week
  • Home
  • News
  • Business
  • Lifestyle
  • Travel
  • Tech
  • Others
  • Hot News
  • Abu Dhabi Week
  • Submit Your Story
Home»Tech»Lakera Introduces Open-Source Security Benchmark to Assess LLM Vulnerabilities in AI Agents
Tech

Lakera Introduces Open-Source Security Benchmark to Assess LLM Vulnerabilities in AI Agents

By StuartOctober 29, 2025No Comments2 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Check Point Software Technologies Ltd. (NASDAQ: CHKP), a global leader in cyber security, together with Lakera, a leading AI-native security platform for Agentic AI applications, and researchers from The UK AI Security Institute (AISI), have announced the launch of the backbone breaker benchmark (b3). This open-source framework is designed specifically to evaluate the security of large language models (LLMs) used within AI agent systems.

The b3 benchmark is based on a new concept known as threat snapshots. Rather than requiring the recreation of an AI agent’s full operational workflow, threat snapshots focus on the precise interaction points where vulnerabilities in LLM behaviour are most likely to occur. By narrowing the assessment to these key moments, developers and model providers can gain clearer insight into how their systems respond under realistic adversarial pressures, without the complexity of modelling an entire agent lifecycle.

“We built the b3 benchmark because today’s AI agents are only as secure as the LLMs that power them,” said Mateo Rojas-Carulla, Co-Founder and Chief Scientist at Lakera, a Check Point company. “Threat Snapshots allow us to systematically surface vulnerabilities that have until now remained hidden in complex agent workflows. By making this benchmark open to the world, we hope to equip developers and model providers with a realistic way to measure, and improve, their security posture.”

The evaluation framework incorporates 10 representative agent “threat snapshots” supported by a high-quality dataset of 19,433 adversarial attacks gathered from Gandalf: Agent Breaker, a gamified red-teaming environment. The benchmark measures exposure to a range of attack types, including system prompt exfiltration, phishing link insertion, malicious code injection, denial-of-service behaviours, and unauthorised tool execution.

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleECOS Dubai Al Furjan Leads the Way in Smart, Sustainable Hospitality Aligned with the UAE Green Agenda 2030
Next Article The Salt Road Doha Unveils Weekly Signature Culinary Experiences
Stuart

Business & Finance Editor, Dubai Week 📍 Based in Dubai — With over a decade of experience dissecting global markets, fiscal policy, and corporate strategy, Stuart Wagner leads the finance desk at Dubai Week, delivering in‑depth analysis tailored to UAE and GCC audiences.

Related Posts

How to Execute Trades on MT4 on PC in the UAE: A Beginner-Friendly Guide to Profitable Forex Trading

March 5, 2026

BenQ Expands MA Series with New Flagship and 4K Nano Gloss Monitors for Mac Users

February 26, 2026

The Power of Webdesign in the Digital Era

February 20, 2026

Technology and Lab Standards Used by Leading IVF Clinics in Dubai

February 20, 2026
News

Jaguar Land Rover and Chery Unveil Freelander Revival at Shanghai Investment Summit

June 13, 20260 News

Senior officials from Abu Dhabi gathered at The Peninsula Shanghai on 11th June for a…

Two-Week Turnaround: How Kwik Payments Plans to Rewire African E-Commerce

June 13, 2026

Twelve luxury homes sold daily as Dubai notches AED5 billion May

June 12, 2026

Eight hours reduced to five minutes: Papua New Guinea defence fund overhauls reconciliation

June 12, 2026
X (Twitter)
  • About Us
  • Privacy Policy
  • DMCA Policy for Dubai Week
  • Editorial Policy
  • Contact
© 2026 Dubai Week

Type above and press Enter to search. Press Esc to cancel.