Simbian Launches Cyber Defense Benchmark, Reports Frontier LLMs Fall Short on Attack Detection

Technology

Apr 29

Corporate Business News Sponsored by Business Watch Global

MOUNTAIN VIEW, Calif., April 29, 2026 (VSNewsNetwork.com) — Cybersecurity company Simbian has announced the formation of the Simbian Research Lab and the release of the Simbian Cyber Defense Benchmark.

VS News Network: Latest News!

Technology

Simbian Launches Cyber Defense Benchmark, Reports Frontier LLMs Fall Short on Attack Detection

Technology

Technology, CEO News

Weathermatic Appoints Lex Mason as Chief Executive Officer

Technology, CEO News

Events, Community

WAYS Community Development Corporation to Host 5th Annual Business GrindOut Session in Inglewood

Events, Community

Technology, CEO News

Presto Co-founder Krishna Gupta Returns as Chief Executive Officer

Technology, CEO News

Finance

CM Wealth Appoints Paul Bodnar as Chief Executive Officer and Managing Member

Finance

Technology

Y Combinator-Backed Vela Launches AI Scheduling Coordinator Focused on Email-Based Meeting Management

Technology

QuSecure Joins NIST NCCoE Consortium on Post-Quantum Cryptography Migration

Technology

Technology, CEO News

Microland Appoints Sam Mathew as Chief Executive Officer

Technology, CEO News

Marketing and Advertising

Vector Media and Reveal Announce Partnership to Introduce Impression-Based Measurement for Mobility Marketing

Marketing and Advertising

Fashion, Apparel, CEO News

On Holding AG Appoints Co-Founders David Allemann and Caspar Coppetti as Co-CEOs

Fashion, Apparel, CEO News

Technology, CEO News

Neat Appoints Javed Khan as Chief Executive Officer

Technology, CEO News

Technology

QuSecure Banking Deployment Cited in Proposed SEC Post-Quantum Financial Infrastructure Framework

Technology

Jelvix Named to 2026 IAOP Global 100 List

Technology

Entertainment

PHOTOS: ‘Project Hail Mary’ London World Premiere Red Carpet

Entertainment

Energy, CEO News

Woodside Energy Appoints Liz Westcott as Chief Executive Officer

Energy, CEO News

Biotechnology, CEO News

Qlaris Bio Appoints Fred Guerard as President and Chief Executive Officer

Biotechnology, CEO News

Movie Premieres, Entertainment

PHOTOS: ‘The Gates’ Los Angeles Special Screening

Movie Premieres, Entertainment

Automotive

Mecum Auctions to Host Five-Day Glendale Event Featuring Approximately 2,000 Vehicles

Automotive

Healthcare, Technology, CEO News

Cotiviti Appoints Ric Sinclair as CEO

Healthcare, Technology, CEO News

Healthcare, CEO News

Dr. Andrea Joy Smith to Present One-Piece Dental Implant Protocol at ICOI 2026 Winter Symposium

Healthcare, CEO News

Healthcare

Dr. Andrea Joy Smith Shares Preventive Oral Healthcare Tips for 2026

Healthcare

Real Estate

Walker & Dunlop Appoints Mark Washington as Managing Director to Lead Pacific Northwest Multifamily Investment Sales

Real Estate

CEO News, Business, Technology

Verint Appoints Dave Rhodes as Chief Executive Officer

CEO News, Business, Technology

Technology

G42 and Publicis Sapient Sign MOU to Explore AI-Focused Joint Venture

Technology

Zenarmor Launches Global SASE Channel Partner Program

Business, Technology

G42 Opens Recruitment Process for Artificial Intelligence Agents in Enterprise Roles

Business, Technology

Sports, CEO News

National Thoroughbred League Appoints Mike Monroe as Chief Executive Officer

Sports, CEO News

Technology

Simbian Launches AI Pentest Agent for On-Demand Enterprise Security Testing

Technology

Calvin McDonald Appointed Chief Executive Officer of The Wella Company

CEO News, Automotive

CarMax Appoints Keith Barr as President and Chief Executive Officer

CEO News, Automotive

The benchmark is designed to evaluate large language models (LLMs) on their ability to detect MITRE ATT&CK chains in complex scenarios using real attack telemetry. According to Simbian, none of the eleven frontier models tested achieved a passing score when tasked with cyber defense investigations.

Anthropic Claude Opus 4.6 achieved the highest performance among the models tested, detecting an average of 46% of attack evidence per MITRE tactic. Simbian states that every model missed entire attack categories.

@VSNewsNetwork

• @VSNewsNetwork •

Textron Names Lisa Atherton as President and CEO

#Textron #LisaAtherton #AerospaceIndustry #DefenseIndustry #CEOAppointments

Integreon Names Bill Carter Interim CEO

#Integreon #BillCarter #BusinessServices #ExecutiveLeadership #CEOAppointments

People.ai Names Jason Ambrose as Chief Executive Officer

#PeopleAI #JasonAmbrose #ArtificialIntelligence #TechnologyIndustry #CEOAppointments

LP Building Solutions Names Jason Ringblom as Next Chief Executive Officer

#LPBuildingSolutions #JasonRingblom #BuildingMaterials #ConstructionIndustry #CEOAppointments

“Our research shows you can't throw an LLM dart in the dark and expect to hit the cyber defense bullseye. The same frontier models that perform strongly during cyberattacks struggle on the defense side. Defense is fundamentally harder than offense as it requires reasoning across noisy, partial evidence rather than executing against a known target. The LLMs must be accompanied by outside intelligence in the form of a sophisticated harness. Simbian has been able to get 95% accuracy in production enterprise environments on cyber defense SecOps following some of these techniques,” said Ambuj Kumar, Founder and Chief Executive Officer of Simbian.

PHOTOS: Latest Hollywood Premiers!

Mar 20, 2026

PHOTOS: ‘Project Hail Mary’ London World Premiere Red Carpet

Mar 20, 2026

Mar 13, 2026

PHOTOS: ‘The Gates’ Los Angeles Special Screening

Mar 13, 2026

Jan 21, 2026

PHOTOS: ‘Mercy’ New York Comic Con Screening

Jan 21, 2026

Nov 15, 2025

PHOTOS: ‘Now You See Me: Now You Don’t’ New York Premiere

Nov 15, 2025

Nov 8, 2025

PHOTOS: ‘I Wish You All the Best’ New York Premiere

Nov 8, 2025

Oct 9, 2025

PHOTOS: ‘After the Hunt’ Los Angeles Premiere

Oct 9, 2025

Sep 23, 2025

PHOTOS: ‘The Strangers: Chapter 2’ Los Angeles Special Screening

Sep 23, 2025

Jun 13, 2025

PHOTOS: ‘From the World of John Wick: Ballerina’ Los Angeles Special Screening

Jun 13, 2025

May 24, 2025

PHOTOS: ‘Fountain of Youth’ World Premiere Screening

May 24, 2025

May 15, 2025

PHOTOS: ‘Hurry Up Tomorrow’ New York Premiere

May 15, 2025

May 6, 2025

PHOTOS: ‘Shadow Force’ Real Housewives of Atlanta Film Screening

May 6, 2025

Apr 25, 2025

PHOTOS: ‘The Accountant 2’ SXSW 2025 World Premiere

Apr 25, 2025

Mar 28, 2025

PHOTOS: ‘Number One on the Call Sheet’ New York & Los Angeles Premieres

Mar 28, 2025

Feb 24, 2025

PHOTOS: ‘The Unbreakable Boy’ New York Movie Premiere

Feb 24, 2025

Jan 22, 2025

PHOTOS: ‘Den of Thieves 2: Pantera’ New York Screening

Jan 22, 2025

Nov 19, 2024

PHOTOS: Apple TV+’s ‘Bread & Roses’ Los Angeles Premiere

Nov 19, 2024

Nov 15, 2024

PHOTOS: The Best Christmas Pageant Ever World Premiere in Los Angeles

Nov 15, 2024

Oct 28, 2024

PHOTOS: ‘A Real Pain’ New York Tastemaker Screening

Oct 28, 2024

Oct 23, 2024

PHOTOS: ‘The Last of the Sea Women’ Movie Premiere at The Toronto International Film Festival

Oct 23, 2024

Oct 22, 2024

PHOTOS: ‘White Bird’ New York Special Screening

Oct 22, 2024

Oct 10, 2024

PHOTOS: ‘Megalopolis’ Toronto International Film Festival Premiere Event

Oct 10, 2024

Oct 9, 2024

PHOTOS: ‘Never Let Go’ World Premiere Screening in New York City

Oct 9, 2024

Sep 30, 2024

Photos: Inside the ‘1992’ Movie Premiere in Los Angeles

Sep 30, 2024

Aug 3, 2023

Videos: Behind The Scenes with The Cast and Crew of The Movie ‘Oppenheimer’

Aug 3, 2023

All Four Barbie Movie Trailers from Warner Bros. and Mattel

Aug 3, 2023

Jul 18, 2023

Videos: Mission: Impossible - Dead Reckoning Part One 2023 World Premiere

Jul 18, 2023

Jun 4, 2023

Photos: Disney And Pixar’s “Elemental” Closes The 76th International Cannes Film Festival

Jun 4, 2023

The benchmark differs from prior cybersecurity benchmarks by using real attack telemetry in an agentic investigation format rather than curated questions. Models from Anthropic, OpenAI, Google and open-weight models from Alibaba, Minimax, DeepSeek and Moonshot AI were tested using a simple ReAct loop and asked to identify attackers and associated tactics.

Anthropic Opus 4.6 identified three times more flags than Google Gemini 3 Flash, but at approximately 100 times the cost, according to Simbian.

"We know the large models can do amazing things, but can we measure their efficacy in analyzing machine logs for security events? This benchmark answers that question. In contrast to existing AI security benchmarks, this benchmark was designed to be difficult to game. It uses real telemetry rather than curated questions, mutates context to prevent memorization, enforces deterministic scoring against ground truth, and tracks detection cost alongside accuracy,” said Richard Stiennon, Chief Research Analyst at IT-Harvest.

Full benchmark results are available in a blog post and the research has been published on arXiv. The company will discuss the findings during a webinar scheduled for April 29.

For more information, visit www.simbian.ai.

Source: Simbian

MORE NEWS FROM GLOBAL MEDIA OUTLETS