Prompt injection attacks might ‘never be properly mitigated’ UK NCSC warns

UK’s NCSC warns prompt injection attacks may never be fully mitigated due to LLM design
Unlike SQL injection, LLMs lack separation between instructions and data, making them inherently vulnerable
Developers urged to treat LLMs as “confusable deputies” and design systems that limit compromised outputs

Prompt injection attacks, meaning attempts to manipulate a large language model (LLM) by embedding hidden or malicious instructions inside user-provided content, might never be properly mitigated.

This is according to the UK’s National Cyber Security Centre’s (NCSC) Technical Director for Platforms Research, David C, who published the assessment in a blog assessing the technique. In the article, he argues that many compare prompt injection to SQL injection, which is inaccurate, since the former is fundamentally different and arguably more dangerous.

LLMs) simply do not enforce a security boundary between instructions and data inside a prompt.”

Prompt injection attacks are regularly reported in systems that use generative AI (genAI), and are the OWASP’s #1 attack to consider when ‘developing and securing generative AI and large language model applications’.

In classical vulnerabilities, data and instructions are handled differently, but LLMs operate purely on next-token prediction, meaning they cannot inherently distinguish user-supplied data from operational instructions. “There’s a good chance prompt injection will never be properly mitigated in the same way,” he added.

The NCSC official also argues that the industry is repeating the same mistakes it made in the early 2000s, when SQL injection was poorly understood, and thus widely exploited.

But, SQL injection was ultimately better understood, and new safeguards became standard. For LLMs, developers should treat them as “inherently confusable deputies”, and thus design systems that limit the consequences of compromised outputs.

If an application cannot tolerate residual risk, he warns, it may simply not be an appropriate use case for an LLM.

The best antivirus for all budgets

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

https://cdn.mos.cms.futurecdn.net/DVffQnnibMWmNpx2Wfb5Se-1920-80.jpg

Source link

Google Adds New Android Controls for WhatsApp Backups, Password Transfers

Europe’s secret drone project from 2003 quietly became the foundation for nearly every major military aviation programme today

Google Adds New Android Controls for WhatsApp Backups, Password Transfers

Quote of the day by Netflix co-founder Reed Hastings: ‘Stone Age. Bronze Age. Iron Age. We define entire epics of humanity by the technology...

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

All 8 Star Wars Characters Played by Jon Favreau & Dave Filoni

New Spider-Man: Brand New Day Trailer Released

How the Aesthetic Industry is Redefining Beauty Standards – Hollywood Life

The Duffer Brothers Series ‘The Boroughs’ Canceled At Netflix

Australia’s ASX proposes 25% cap on share issuance in M&A without shareholder vote

Apple prepares second-generation iPhone Air for spring 2027

Samsara director Jonathan Chadwick sells $334,723 in stock

How surging gold prices led to the biggest jump on this year’s Southeast Asia 500

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays

Prompt injection attacks might ‘never be properly mitigated’ UK NCSC warns

Australia’s ASX proposes 25% cap on share issuance in M&A without shareholder vote

Google Adds New Android Controls for WhatsApp Backups, Password Transfers

All 8 Star Wars Characters Played by Jon Favreau & Dave Filoni

Apple prepares second-generation iPhone Air for spring 2027