OpenAI says it’s had to protect its Atlas AI browser against some serious security threats

OpenAI says prompt injection attacks can’t be fully eliminated, only mitigated
Malicious prompts hidden in websites can trick AI browsers into exfiltrating data or installing malware
OpenAI’s rapid response loop uses adversarial training and automated discovery to harden defenses

OpenAI has claimed that while AI browsers might never be fully protected from prompt injection attacks, that doesn’t mean the industry should simply give up on the idea or admit defeat to the scammers – there are ways to harden the products.

The company published a new blog post discussing cybersecurity risks in its AI-powered browser, Atlas, in which it shared the somewhat grim outlook.

“Prompt injection, much like scams and social engineering on the web, is unlikely to ever be fully ‘solved,’” the blog reads. “But we’re optimistic that a proactive, highly responsive rapid response loop can continue to materially reduce real-world risk over time. By combining automated attack discovery with adversarial training and system-level safeguards, we can identify new attack patterns earlier, close gaps faster, and continuously raise the cost of exploitation.”

Rapid response loop

So what exactly is prompt injection, and what is this “rapid response loop” approach?

Prompt injection is a type of attack in which a malicious prompt is “injected” into the victim’s AI agent without their knowledge, or consent.

For example, an AI browser could be allowed to read all of the contents of a website. If that website is malicious (or hijacked) and contains a hidden prompt (white letters on a white background, for example), the AI might act on it without the user ever realizing anything.

That prompt could be different things, from exfiltrating sensitive files, to downloading and running malicious browser addons.

OpenAI wants to fight fire with fire, it seems. It created a bot, trained through reinforced learning, and let it be the hacker looking for ways in. It pits that bot against an AI defender who then go back and forth, trying to outwit one another. The end result is the AI defender capable of spotting most attack techniques.

The best antivirus for all budgets

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

https://cdn.mos.cms.futurecdn.net/3Ek42Bm7W4No2qAL4PKvCU-970-80.jpg

Source link

watchOS fitness apps need to make better use of the Apple Watch’s incredible user interface

Fitbit users have been given more time to migrate their accounts over to Google

NYT Strands hints and answers for Sunday, February 1 (game #700)

NYT Connections hints and answers for Sunday, February 1 (game #966)

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Which Celebrity Styles Americans Copy Most in 2025: New Study

New ‘Westworld’ trailer introduces us to another dystopian tech company

What’s the point of ‘Charlie’s Angels’ without Sam Rockwell dancing?

These striking photos capture the future of human flight

Enterprise Products Partners' SWOT analysis: midstream giant's stock resilience tested

JetBlue's SWOT analysis: airline stock faces turbulence amid strategic shifts

Minnesota lawmaker killed on Saturday served with compassion, governor says

Minnesota shooting suspect told friend in text message: I might be dead soon

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays