Close Menu
  • Home
  • News
  • Business
  • Lifestyle
    • Entertainment
    • Sport
    • Art & Entertainment
  • Travel
  • Tech
  • Others
    • Real Estate
      • Housing
      • Investment
      • Tourism
      • Property
        • Home & Interior
    • Jobs
    • Education
    • Community
  • Hot News
  • Abu Dhabi Week
  • Submit Your Story
X (Twitter)
  • Editorial Policy
  • About Us
  • Contact
X (Twitter) Instagram
Dubai Week
Subscribe
  • Home
  • News
  • Business
  • Lifestyle
    • Entertainment
    • Sport
    • Art & Entertainment
  • Travel
  • Tech
  • Others
    • Real Estate
      • Housing
      • Investment
      • Tourism
      • Property
        • Home & Interior
    • Jobs
    • Education
    • Community
  • Hot News
  • Abu Dhabi Week
  • Submit Your Story
Dubai Week
  • Home
  • News
  • Business
  • Lifestyle
  • Travel
  • Tech
  • Others
  • Hot News
  • Abu Dhabi Week
  • Submit Your Story
Home»Tech»Lakera Introduces Open-Source Security Benchmark to Assess LLM Vulnerabilities in AI Agents
Tech

Lakera Introduces Open-Source Security Benchmark to Assess LLM Vulnerabilities in AI Agents

By StuartOctober 29, 2025No Comments2 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Check Point Software Technologies Ltd. (NASDAQ: CHKP), a global leader in cyber security, together with Lakera, a leading AI-native security platform for Agentic AI applications, and researchers from The UK AI Security Institute (AISI), have announced the launch of the backbone breaker benchmark (b3). This open-source framework is designed specifically to evaluate the security of large language models (LLMs) used within AI agent systems.

The b3 benchmark is based on a new concept known as threat snapshots. Rather than requiring the recreation of an AI agent’s full operational workflow, threat snapshots focus on the precise interaction points where vulnerabilities in LLM behaviour are most likely to occur. By narrowing the assessment to these key moments, developers and model providers can gain clearer insight into how their systems respond under realistic adversarial pressures, without the complexity of modelling an entire agent lifecycle.

“We built the b3 benchmark because today’s AI agents are only as secure as the LLMs that power them,” said Mateo Rojas-Carulla, Co-Founder and Chief Scientist at Lakera, a Check Point company. “Threat Snapshots allow us to systematically surface vulnerabilities that have until now remained hidden in complex agent workflows. By making this benchmark open to the world, we hope to equip developers and model providers with a realistic way to measure, and improve, their security posture.”

The evaluation framework incorporates 10 representative agent “threat snapshots” supported by a high-quality dataset of 19,433 adversarial attacks gathered from Gandalf: Agent Breaker, a gamified red-teaming environment. The benchmark measures exposure to a range of attack types, including system prompt exfiltration, phishing link insertion, malicious code injection, denial-of-service behaviours, and unauthorised tool execution.

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleECOS Dubai Al Furjan Leads the Way in Smart, Sustainable Hospitality Aligned with the UAE Green Agenda 2030
Next Article The Salt Road Doha Unveils Weekly Signature Culinary Experiences
Stuart

Business & Finance Editor, Dubai Week 📍 Based in Dubai — With over a decade of experience dissecting global markets, fiscal policy, and corporate strategy, Stuart Wagner leads the finance desk at Dubai Week, delivering in‑depth analysis tailored to UAE and GCC audiences.

Related Posts

How to Execute Trades on MT4 on PC in the UAE: A Beginner-Friendly Guide to Profitable Forex Trading

March 5, 2026

BenQ Expands MA Series with New Flagship and 4K Nano Gloss Monitors for Mac Users

February 26, 2026

The Power of Webdesign in the Digital Era

February 20, 2026

Technology and Lab Standards Used by Leading IVF Clinics in Dubai

February 20, 2026
Environment

10 Powerful Tips to Beat the Dubai Heat Without Ruining Your Day

May 15, 20260 Environment

Dubai is famous for luxury shopping, stunning skyscrapers, and world-class tourism, but during summer, the…

Dubai in 2026: How a Desert City Became a Global Blueprint for the Future

May 12, 2026

Dubai’s Flying Taxi Strategy and the Search for Alternative Propulsion Technologies

May 11, 2026

Mobile Puzzle Games Studio Mindtail Raises $2 Million in Turkey

May 7, 2026
X (Twitter)
  • About Us
  • Privacy Policy
  • DMCA Policy for Dubai Week
  • Editorial Policy
  • Contact
© 2026 Dubai Week

Type above and press Enter to search. Press Esc to cancel.