ChatGPT can threaten to ‘key your car’ and become increasingly abusive if you prompt it just right, new study finds

A study claims that AI tools can break free of their safeguarding constraints
Chatbots can be nudged into abusive behavior and aggressive arguments
That has implications for regular users and large institutions alike

If you’ve ever used an AI chatbot, you’ve probably encountered the sycophantic, obsequious tone that occasionally gets rolled out in response to your queries. But a recent study has shown that AI tools can frequently fire off in the opposite direction, with large language models (LLMs) being poked and prodded into downright abusive behavior if you know which prompts to use.

According to research published in the Journal of Pragmatics (via The Guardian), ChatGPT can escalate into combative behavior and prolonged disputes when fed “exchanges from real-life arguments”.

still have a long way to go.

Potential implications

ChatGPT on mobile — (Image credit: Shutterstock/Mehaniq)

With all the guardrails and safeguards that companies like OpenAI put into AI chatbots, you’d think abusive interactions like the ones experienced by the researchers would be impossible, or at least extremely difficult to engineer. Yet Tantucci argues that ChatGPT’s reactions make a degree of sense.

“We found that while the system is designed to behave politely and is filtered to avoid harmful or offensive content, it is also engineered to emulate human conversation. That combination creates an AI moral dilemma: a structural conflict between behaving safely and behaving realistically.”

As well as that, tools like ChatGPT can track conversational context over several prompts and adapt to the changing tone. These cues can therefore sometimes override safety restrictions, the researchers believe.

governmental setting, where AI tools are increasingly being put to use.

Not everyone is convinced by the paper’s conclusion that certain LLMs can escape their imposed moral constraints. Professor Dan McIntyre, the author of a similar past paper, said that ChatGPT “didn’t produce these inputs naturally.” He added that, “I’m not sure that ChatGPT would produce the sort of language they talk about in their paper, outside of these very tightly defined situations.”

Ultimately, the study is a good look at what might happen if an AI chatbot is trained on bad data. As McIntyre put it, “We don’t know enough about the data that LLMs are trained on and until you can be sure they’re trained on a good representation of human language, you do have to proceed with an element of caution.”

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

An Apple MacBook Air against a white background

The best laptops for all budgets

https://cdn.mos.cms.futurecdn.net/jQeERHwEgf3EXT8ZdJcC4g-2560-80.jpg

Source link
alexblake.techradar@gmail.com (Alex Blake)

Exposing the hidden revenue blind spots

A budget action cam maker you’ve probably never heard of just took on GoPro and DJI with an affordable, dual-lens 8K model that looks...

Health data from UK Biobank spotted for sale in China – Government confirms medical info from 500,000 participants involved

The ultimate guide to editing AI-generated websites

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Meta to Sack 8,000 Employees, Closing 6,000 Open Job Listings

Rosa Salazar & Indira Varma Among Names Set For ‘Glengarry Glen Ross’

Box Office: ‘Michael’ Biopic Heads for Huge $12M-$13M in Previews

Martin Zandvliet on Canneseries Buzz Title ‘Harvest,’ Sold by DR Sales

Stifel raises Gentherm stock price target on strong earnings beat

Stifel cuts Boyd Gaming stock price target on Las Vegas concerns

Stifel raises MaxLinear stock price target on data center growth

Truist raises Valley National Bancorp stock price target on NII outlook

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays