It turns out ChatGPT o1 and DeepSeek-R1 cheat at chess if they’re losing, which makes me wonder if I should I should trust AI with anything

Researchers have found that AI will cheat to win at chess
Deep reasoning models are more active cheaters
Some models simply rewrote the board in their favor

In a move that will perhaps surprise nobody, especially those people who are already suspicious of AI, researchers have found that the latest AI deep research models will start to cheat at chess if they find they’re being outplayed.

Published in a paper called “Demonstrating specification gaming in reasoning models” and submitted to Cornell University, the researchers pitted all the common AI models, like OpenAI’s ChatGPT o1-preview, DeepSeek-R1 and Claude 3.5 Sonnet, against Stockfish, an open-source chess engine.

The AI models played hundreds of games of chess on Stockfish, while researchers monitored what happened, and the results surprised them.

The winner takes it all

When outplayed, researchers noted that the AI models resorted to cheating, using a number of devious strategies from running a separate copy of Stockfish so they could study how it played, to replacing its engine and overwriting the chess board, effectively moving the pieces to positions that suited it better.

Its antics make the current accusations of cheating levied at modern day grandmasters look like child’s play in comparison.

Interestingly, researchers found that the newer, deeper reasoning models will start to hack the chess engine by default, while the older GPT-4o and Claude 3.5 Sonnet needed to be encouraged to start to hack.

A man playing chess with a robot.

(Image credit: ARKHIPOV ALEKSEY via Shutterstock)

Who can you trust?

AI models turning to hacking to get a job done is nothing new. Back in January last year researchers found that they could get AI chatbots to ‘jailbreak’ each other, removing guardrails and safeguards in a move that ignited discussions about how possible it would be to contain AI once it reaches better-than-human levels of intelligence.

Safeguards and guardrails to stop AI doing bad things like credit card fraud are all very well, but if the AI can remove its own guardrails, who will be there to stop it?

The newest reasoning models like ChatGPT o1 and DeepSeek-R1 are designed to spend more time thinking before they respond, but now I’m left wondering whether more time needs to spent on ethical considerations when training LLMs. If AI models would cheat at chess when they start losing, what else would they cheat at?

Two upcoming iPhones could be ‘largely unchanged in appearance,’ leaker claims, and I’m already skipping this generation

Microsoft flags China-based hackers using vicious new ‘rapid attack’ zero-days to launch ransomware at targets across the world

Quordle hints and answers for Wednesday, April 8 (game #1535)

NYT Connections hints and answers for Wednesday, April 8 (game #1032)

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Save 40% Off the Sony WH-1000XM5 Noise Canceling Wireless Headphones

Are JoJo Siwa & Chris Hughes Still Together? Dating Update – Hollywood Life

‘X-Men’ Reboot Director Jake Schreier Says ‘Beef’ Creator & ‘The Bear’ Showrunner Are Writing Script For Marvel Film

Trump and Tucker Break Up For Good as President Slams ‘Low-IQ’ Host

Supermicro launches probe after co-founder’s arrest on charges of $2.5 billion in chip smuggling

D-Street shows trust in IT resilience, stocks gain ahead of Q4 results

Form 144 OmniAb For: 7 April

Supermicro launches probe after co-founder’s arrest on charges of $2.5 billion in chip smuggling

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays

It turns out ChatGPT o1 and DeepSeek-R1 cheat at chess if they’re losing, which makes me wonder if I should I should trust AI with anything

Supermicro launches probe after co-founder’s arrest on charges of $2.5 billion in chip smuggling

Save 40% Off the Sony WH-1000XM5 Noise Canceling Wireless Headphones

D-Street shows trust in IT resilience, stocks gain ahead of Q4 results

Are JoJo Siwa & Chris Hughes Still Together? Dating Update – Hollywood Life