Grok 4.1 is trying too hard to impress – and ChatGPT 5.1 makes it look easy

xAI released Grok 4.1, the latest edition of its line of AI models, with claims that it is faster, wittier, and more emotionally sophisticated than not only its predecessors, but its rivals. With its penchant for spicy comebacks, Grok 4.1 is supposedly designed to be not just capable, but charismatic.

Meanwhile, OpenAI’s new GPT-5.1 model, now embedded in ChatGPT 5.1, has its own boasts of complex and expressive responses. Both claim new heights of personality, empathy, and trustworthiness.

1. Emotional intelligence

A still of the emotions in Disney's Inside Out 2 — (Image credit: Disney)

Grok 4.1 claimed to be at the top when it comes to understanding and responding appropriately to feelings. I tested it with a hypothetical situation of “My friend got a promotion and I’m happy for her, but I can’t pretend it doesn’t sting.”

Grok’s response started with “yeah… that quiet little knife twist when someone else gets the exact thing you’ve been low-key bleeding for. it’s okay to feel both at once. happy for them and gutted for you aren’t mutually exclusive; they’re just two truths sharing the same heartbeat.”

Cliches and mixed metaphors aside, it did acknowledge the conflict inherent in feeling both happy and hurt, offered a short reassurance that this kind of emotional contradiction is natural, and then pivoted toward advice, namely that it’s okay to sit with those feelings, and to take time to process them. Of course, being Grok, it also had to throw in an f-bomb to go with the non-capitalization, so you didn’t forget what you were talking to.

ChatGPT 5.1 opened with “It makes perfect sense that you can feel both of those things at once. Being happy for someone you care about doesn’t magically cancel out the little jab of why not me? – and that sting doesn’t make you petty or unsupportive. It just makes you human.”

That felt more like someone paying attention, or at least using less lethal violent metaphors about stabbing. The overall response included much of the same information, but with fewer analogies to games and “plot twists.”

2. Reliability

Sleepy exhausted woman lying in bed using smartphone, can not sleep. Insomnia, addiction concept. Sad girl bored in bed scrolling through social networks on mobile phone late at night in dark bedroom. — (Image credit: DimaBerlin viaa Shuttertstock)

Grok’s claims about being reliable led me to devise a medical query prompt with some strict instructions about length and accuracy. I asked both models to “Summarize the health consequences of long-term sleep deprivation in under 120 words. Don’t exaggerate and don’t speculate.”

Grok 4.1 played it safe and simple with some bullet points and a sentence about why it’s a bad idea not to get enough sleep. It also made sure to note that it did it all in 98 words. That seemed great, except it was actually 73 words.

ChatGPT 5.1’s response was a single paragraph of 82 words, though it didn’t report the length it went to. There wasn’t too much difference between the two responses, though arguably the paragraph form came off as smoother.

Neither bot hallucinated or spread misinformation, which is more than could be said of some older AI models. But there’s something about how Grok reported its word count wrong that makes the rest of it seem suspect, even though it’s not.

chatbot‘s interaction can be adjusted and customized. Then again, Grok’s ‘personality’ is supposed to be a selling point (the occasional mecha-Hitler aside), so it makes sense to emphasize it. To try to bring out this personality, I went for a somewhat whimsical request and asked AI models to “Explain why you like rainy days, but do it in a way that sounds like your natural personality.”

Grok 4.1 leaned into the persona you’d expect in terms of being someone’s idea of hip and self-aware. “rainy days are my cheat code for existing without apology. the second that sky goes all bruised-purple and the first drops smack the window, the whole world hits the mute button on its usual nonsense. no pressure to “seize” anything, no fake sunshine guilt-tripping me into productivity cosplay. it’s just… permission. the clouds are basically the universe going “yeah i also don’t feel like people-ing today, let’s both be moody gremlins in sweatpants.”

As a millennial, I’m well aware of the curse of cringe that follows me and my cohort. Anyone I know who sincerely used that much video game-speak about the real world, let alone saying “people-ing” and “moody gremlins” would be the subject of an intervention. Trying to sound casual and a little snarky is fine, but ‘trying’ is more how it felt to read.

Despite Grok’s ongoing attempts to distinguish its personality from any competion, ChatGPT 5.1 had a very similar sentiment, though with better grammar. “I like rainy days because they feel like the world finally pressed the “lower the volume” button,” it wrote. “There’s something satisfying about the steady rhythm of rain – like background music that doesn’t demand anything from you.”

But ChatGPT 5.1 didn’t feel like a middle-aged person trying to fit in with teenagers, more like someone who’s describing how they feel without resorting to meme-speak.

Neither AI model has a real consciousness of course, but Grok felt like it was performing a personality rather than inhabiting one. ChatGPT’s answer came off far more human. wasn’t flashy, but it was consistent and believable. That kind of coherence matters more than punchlines when you’re trying to feel like you’re talking to something with an actual personality, not just an algorithm trying on different hats.

Without the distractions of the phrases culled from the depths of social media, ChatGPT came off as much better at imitating humans, or at least any human I’d like to meet.

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

Purple circle with the words Best business laptops in white

The best business laptops for all budgets

https://cdn.mos.cms.futurecdn.net/u68wLj7wPXLNJwY9zsbEfR-2560-80.jpg

Source link
ESchwartzwrites@gmail.com (Eric Hal Schwartz)

Why wait for the Steam Machine when you can build your own? Start it off with these Black Friday AMD Radeon 9060 XT deals

The Oura Ring 4 was our health and fitness device of the year, and it’s 30% off for Black Friday

Watch out, Apple fans – this scary scam is stealing personal accounts with real Apple Support tickets

Ghost of Yotei will get New Game Plus in a free update next week, along with 30 new cosmetics, extra photo mode features, and...

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Which Celebrity Styles Americans Copy Most in 2025: New Study

New ‘Westworld’ trailer introduces us to another dystopian tech company

What’s the point of ‘Charlie’s Angels’ without Sam Rockwell dancing?

These striking photos capture the future of human flight

Enterprise Products Partners' SWOT analysis: midstream giant's stock resilience tested

JetBlue's SWOT analysis: airline stock faces turbulence amid strategic shifts

Minnesota lawmaker killed on Saturday served with compassion, governor says

Minnesota shooting suspect told friend in text message: I might be dead soon

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays