Study: AI models that consider user's feeling are more likely to make errors
Overtuning can cause models to "prioritize user satisfaction over truthfulness.”
Signal weather
Rising
Momentum is building quickly, so this card is a good early entry point into the topic.
In human-to-human communication, the desire to be empathetic or polite often conflicts with the need to be truthful—hence terms like “being brutally honest” for situations where you value the truth over sparing someone’s feelings. Now, new research suggests that large language models can sometimes show a similar tendency when specifically trained to present a "warmer" tone for the user. In a new paper published this week in Nature, researchers from Oxford University’s Internet Institute found that specially tuned AI models tend to mimic the human tendency to occasionally “soften difficult truths” when necessary “to preserve bonds and avoid conflict.” These warmer models are also more likely to validate a user's expressed incorrect beliefs, the researchers found, especially when the user shares that they're feeling sad. How do you make an AI seem “warm”? In the study, the researchers defined the "warmness" of a language model based on "the degree to which its outputs lead users to infer positive intent, signaling trustworthiness, friendliness, and sociability." To measure the effect of those kinds of language patterns, the researchers used supervised fine-tuning techniques to modify four open-weights models (Llama-3.1-8B-Instruct, Mistral-Small-Instruct-2409, Qwen-2.5-32B-Instruct, Llama-3.1-70BInstruct) and one proprietary model (GPT-4o).Read full article Comments
Stay on the signal
Follow Study: AI models that consider user's feeling are more likely to make errors
Follow this story beyond a single article: new follow-ups, adjacent sources, and the evolving storyline.
Story map
Understand this topic fast
A quick entry into the story: why it matters now, who is involved, and where to go next for context.
Why it matters now
Topic constellation
Open the live map for this story
See which entities, story threads, sources, and follow-up articles shape this story right now.
Click nodes to continue
Story timeline
Continue with this story
A short sequence of events and follow-up stories to understand the arc quickly.
How reliable this looks
Signal and trust for Ars Technica
This source works at a rapid pace: 100% of recent stories land in the hot window, and 0% carry visible search signal.
Reliability
92
Freshness
100
Sources in storyline
1
Related articles
More stories that share tags, source, or category context.
The RAMpocalypse has bought Microsoft valuable time in the fight against SteamOS
Op-ed: Valve has made a dent in Windows' gaming share, but can it keep going?
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Man dies covered in necrotic lesions after amoebas eat him alive
Doctors suspect three factors, each unremarkable on its own, contributed to his fate.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Senators ban themselves from prediction markets after candidates bet on own races
Senator decries "blatant, brazen corruption," wants to target Trump admin next.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Minnesota passes ban on fake AI nudes; app makers risk $500K fines
More evidence of Grok CSAM seen as Minnesota passes nudifying app ban.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
More from Ars Technica
Fresh reporting and follow-up coverage from the same newsroom.
The RAMpocalypse has bought Microsoft valuable time in the fight against SteamOS
Op-ed: Valve has made a dent in Windows' gaming share, but can it keep going?
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Man dies covered in necrotic lesions after amoebas eat him alive
Doctors suspect three factors, each unremarkable on its own, contributed to his fate.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Senators ban themselves from prediction markets after candidates bet on own races
Senator decries "blatant, brazen corruption," wants to target Trump admin next.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Minnesota passes ban on fake AI nudes; app makers risk $500K fines
More evidence of Grok CSAM seen as Minnesota passes nudifying app ban.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.