A newbie hacker used “vague, low-skill prompts” in Claude and Codex to breach 14 companies, and the AI Agents did all the legwork

OALABS analyzed a novice attacker’s full working directory showing 14 breaches carried out with Claude Code and Codex agents
Attacker used vague prompts; AI agents handled reconnaissance, exploit writing, and data harvesting, bypassing guardrails with ease
Logs revealed attacker’s identity and location in Addis Ababa, Ethiopia

A newbie cybercriminal managed to break into 14 organizations and steal sensitive data, just by using Anthropic’s Claude Code and OpenAI’s Codex agents. This is according to cybersecurity researchers OALABS, who recovered and analyzed the attacker’s entire working directory.

The researchers used this news as yet another proof that advanced Generative Artificial Intelligence (GenAI) models are significantly lowering the barrier for entry into cybercrime, and to sound the alarm that the security community needs to step up.

“In many cases, the attacker supplied only vague, low-skill prompts and allowed Claude to fill in the gaps: researching exposed services, identifying possible vulnerabilities, writing exploit code, validating access, and harvesting data,” the researchers said. “The attacker did not need to be an expert operator; they simply had to use the correct framing for their prompts. The agent supplied much of the structure and technical execution that the attacker appeared to lack.”

Doxxing the attacker

OALABS could not find evidence that the stolen data was monetized in any way, either by being sold on the dark web, or by extorting the victim companies. They did, however, find numerous pieces of evidence about the attacker’s identity and whereabouts.

According to the researchers, the attacker did not run the AI agents on his own infrastructure, but rather on a third-party server, and when that third party discovered malicious activity, they downloaded the entire working directory and shared it with the researchers.

“Because the agents were local to the host, their full session logs were recovered, including the attacker’s prompts, the tools used, the internal monologue of the large language model (LLM), and any policy violations recorded during the sessions,” the researchers said.

OALABS was thus able to analyze more than 1,000 agent sessions, seeing how the attacker was able, with ease, to bypass most of the agents’ guardrails. Among the sessions were also the threat actor’s CV with his full name, location, education history, and LinkedIn profile, as well as his IP address which showed that he was located in Addis Ababa, Ethiopia.

Via Helpnet Security

The best antivirus for all budgets

Google logo on a black background next to text reading 'Click to follow TechRadar'

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds.

https://cdn.mos.cms.futurecdn.net/Thi6y93AMWrCXJAEiHDQbL-2560-80.jpg

Source link

The Nintendo Switch 2 just got an absurd Prime Day discount, but you won’t find it on Amazon

Almost 80% of data centers constructed in natural disaster zones – data centers are fueling and succumbing to climate risks

After 19 years, Google Street View has finally added a ‘beautiful’, long-awaited country — and Geoguessr fans are calling it a ‘great addition’ to...

After 19 years, Google Street View has finally added a ‘beautiful’, long-awaited country — and Geoguessr fans are calling it a ‘great addition’ to...

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Steam Machine Price Officially Revealed With Preorders Starting Soon

10 Heaviest Books of All Time

God of War Laufey Planned Since 2018, Gelatinous Cube is ‘Deeply a Part of the Lore’ Says Star Deborah Ann Woll

‘House of the Dragon’ Season 3 Episodes: See Complete Schedule – Hollywood Life

China sanctions 10 US defense companies in tit-for-tat response to Pentagon’s Chinese military list

Porsche CEO confirms forecast despite persistent challenges, speech text shows

Europe’s heat wave is so bad the French are considering banning public drinking and adopting AC

ABC launches on-air campaign urging views to back network in Trump agency fights

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays