Ars Technica May 1, 2026 at 15:32 Big Tech Rising Hot

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

New results suggest Mythos' cyber threat isn't "a breakthrough specific to one model."

Signal weather

Rising

Momentum is building quickly, so this card is a good early entry point into the topic.

By Kyle Orland Original source

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Last month, Anthropic made a big deal about the supposedly outsize cybersecurity threat represented by its Mythos Preview model, leading the company to restrict the initial release to “critical industry partners.” But new research from the UK's AI Security Institute (AISI) suggests that OpenAI's GPT-5.5, which launched publicly last week, reached "a similar level of performance on our cyber evaluations" as Mythos Preview, which the group evaluated last month. Since 2023, the AISI has run a variety of frontier AI models through 95 different Capture the Flag challenges designed to test capabilities on cybersecurity tasks, such as reverse engineering, web exploitation, and cryptography. On the highest-level "Expert" tasks, GPT-5.5 passed an average of 71.4 percent, slightly higher than the 68.6 percent achieved by Mythos Preview (though within the margin of error). In one particularly difficult task that involved building a disassembler to decode a Rust binary, AISI notes that "GPT-5.5 solved the challenge in 10 minutes and 22 seconds with no human assistance at a cost of $1.73" in API calls. GPT-5.5 also matched Mythos Preview in its progress on "The Last Ones" (TLO), an AISI test range set up to simulate a 32-step data extraction attack on a corporate network. GPT-5.5 succeeded in 3 of 10 attempts on TLO, compared to 2 of 10 for Mythos Preview—no previous model had ever succeeded at the test even once. But GPT-5.5 still fails at AISI's more difficult "Cooling Tower" simulation of an attempted disruption of the control software for a power plant, as every previously tested AI model also has. Read full article Comments

Read the full article

Stay on the signal

Follow GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Follow this story beyond a single article: new follow-ups, adjacent sources, and the evolving storyline.

Story map

Understand this topic fast

A quick entry into the story: why it matters now, who is involved, and where to go next for context.

Why it matters now

Fresh coverage with immediate momentum.

There are already 6 connected articles in the same storyline to continue from here.

The story keeps orbiting around Ars Technica, Breakthrough, and Breakthrough Specific, so the entity pages are the fastest way to build context.

Ars Technica already has 4 follow-up stories on the same theme.

Topic constellation

Open the live map for this story

See which entities, story threads, sources, and follow-up articles shape this story right now.

Click nodes to continue

Entity Cluster Article Hub Source

Entity pages

Ars Technica Breakthrough Breakthrough Specific Cybersecurity GPT-5.5 Matches Heavily

Story threads

Ars Technica

Latest coverage and related links about Ars Technica.

Ars Technica

Последние материалы и связанный контекст по теме Ars Technica.

Breakthrough

Последние материалы и связанный контекст по теме Breakthrough.

GPT-5.5

Latest coverage and related links about GPT-5.5.

Story timeline

Continue with this story

A short sequence of events and follow-up stories to understand the arc quickly.

May 1, 2026 at 16:24 Ars Technica

Scorpions go terminator mode and reinforce their weapons with metal

Different hunting patterns seem to dictate different distributions of metal.

May 1, 2026 at 15:32 Ars Technica

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

New results suggest Mythos' cyber threat isn't "a breakthrough specific to one model."

May 1, 2026 at 15:23 Ars Technica

Is your Purosangue SUV not sharp enough? Ferrari has you covered.

We'll soon get to see the brand's first EV; first, a more honed V12 four-seater.

May 1, 2026 at 14:42 Ars Technica

Virgin Galactic reveals new ship, but it's running out of time and cash

It's not clear whether Virgin Galactic has the cash reserves to fund a prolonged test phase.

May 1, 2026 at 14:10 Ars Technica

Apple may take "several months" to catch up to Mac mini and Studio demand

Chip shortages and demand from AI enthusiasts are both playing a part.

May 1, 2026 at 13:26 Ars Technica

Women sue the men who used their Instagram feeds to create AI porn influencers

AI ModelForge is a platform that teaches men how to generate their own AI influencers.

How reliable this looks

Signal and trust for Ars Technica

This source works at a rapid pace: 100% of recent stories land in the hot window, and 0% carry visible search signal.

Trusted

Reliability

Freshness

100

Sources in storyline

More stories that share tags, source, or category context.

Scorpions go terminator mode and reinforce their weapons with metal

Ars Technica May 1, 2026 at 16:24 Big Tech

Rising Hot

Scorpions go terminator mode and reinforce their weapons with metal

Different hunting patterns seem to dictate different distributions of metal.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Different Different Distributions Different Hunting

Read article Follow story

arstechnica.com

Is your Purosangue SUV not sharp enough? Ferrari has you covered.

Ars Technica May 1, 2026 at 15:23 Big Tech

Rising Hot

Is your Purosangue SUV not sharp enough? Ferrari has you covered.

We'll soon get to see the brand's first EV; first, a more honed V12 four-seater.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Covered Enough Ferrari

Read article Follow story

arstechnica.com

SecurityLab May 1, 2026 at 15:00 Cybersecurity

Rising Hot

Атаки на больницы и взлом электростанций. Нейросеть Mythos начала играть по своим правилам

США прервали развитие Mythos.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Mythos Mythos. SecurityLab

Read article Follow story

securitylab.ru

Virgin Galactic reveals new ship, but it's running out of time and cash

Ars Technica May 1, 2026 at 14:42 Big Tech

Rising Hot

Virgin Galactic reveals new ship, but it's running out of time and cash

It's not clear whether Virgin Galactic has the cash reserves to fund a prolonged test phase.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Cash Galactic Prolonged

Read article Follow story

arstechnica.com

More from Ars Technica

Fresh reporting and follow-up coverage from the same newsroom.

Open source page

Ars Technica May 1, 2026 at 16:24 Big Tech

Rising Hot

Scorpions go terminator mode and reinforce their weapons with metal

Different hunting patterns seem to dictate different distributions of metal.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Different Different Distributions Different Hunting

Read article Follow story

arstechnica.com

Ars Technica May 1, 2026 at 15:23 Big Tech

Rising Hot

Is your Purosangue SUV not sharp enough? Ferrari has you covered.

We'll soon get to see the brand's first EV; first, a more honed V12 four-seater.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Covered Enough Ferrari

Read article Follow story

arstechnica.com

Ars Technica May 1, 2026 at 14:42 Big Tech

Rising Hot

Virgin Galactic reveals new ship, but it's running out of time and cash

It's not clear whether Virgin Galactic has the cash reserves to fund a prolonged test phase.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Ars Technica Cash Galactic Prolonged

Read article Follow story

arstechnica.com

Apple may take "several months" to catch up to Mac mini and Studio demand

Ars Technica May 1, 2026 at 14:10 Big Tech

Rising Hot

Apple may take "several months" to catch up to Mac mini and Studio demand

Chip shortages and demand from AI enthusiasts are both playing a part.

Signal weather

Momentum is building quickly, so this card is a good early entry point into the topic.

Why now

Fresh coverage with immediate momentum.

Apple Apple May Ars Technica Chip

Read article Follow story

arstechnica.com

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Follow GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Understand this topic fast

Why it matters now

Open the live map for this story

Entity pages

Story threads

Continue with this story

Signal and trust for Ars Technica

Related articles

More from Ars Technica