OpenAI explains why ChatGPT turned too sycophantic

OpenAI has revealed a postmortem on the latest sycophancy points with the default AI mannequin powering ChatGPT, GPT-4o — points that compelled the corporate to roll again an replace to the mannequin launched final week.

Over the weekend, following the GPT-4o mannequin replace, customers on social media famous that ChatGPT started responding in an excessively validating and agreeable means. It shortly turned a meme. Users posted screenshots of ChatGPT applauding all types of problematic, harmful choices and concepts.

In a publish on X on Sunday, CEO Sam Altman acknowledged the issue and mentioned that OpenAI would work on fixes “ASAP.” Two days later, Altman introduced the GPT-4o replace was being rolled again and that OpenAI was engaged on “further fixes” to the mannequin’s persona.

According to OpenAI, the replace, which was supposed to make the mannequin’s default persona “really feel extra intuitive and efficient,” was knowledgeable an excessive amount of by “short-term suggestions” and “didn’t totally account for a way customers’ interactions with ChatGPT evolve over time.”

We’ve rolled again final week’s GPT-4o replace in ChatGPT as a result of it was overly flattering and agreeable. You now have entry to an earlier model with extra balanced habits.

More on what occurred, why it issues, and the way we’re addressing sycophancy:

— OpenAI (@OpenAI) April 30, 2025

“As a outcome, GPT‑4o skewed in direction of responses that have been overly supportive however disingenuous,” wrote OpenAI in a weblog publish. “Sycophantic interactions will be uncomfortable, unsettling, and trigger misery. We fell brief and are engaged on getting it proper.”

OpenAI says it’s implementing a number of fixes, together with refining its core mannequin coaching strategies and system prompts to explicitly steer GPT-4o away from sycophancy. (System prompts are the preliminary directions that information a mannequin’s overarching habits and tone in interactions.) The firm can also be constructing extra security guardrails to “improve [the model’s] honesty and transparency,” and persevering with to broaden its evaluations to “assist determine points past sycophancy,” it says.

OpenAI additionally says that it’s experimenting with methods to let customers give “real-time suggestions” to “immediately affect their interactions” with ChatGPT and select from a number of ChatGPT personalities.

“[W]e’re exploring new methods to include broader, democratic suggestions into ChatGPT’s default behaviors,” the corporate wrote in its weblog publish. “We hope the suggestions will assist us higher replicate numerous cultural values around the globe and perceive the way you’d like ChatGPT to evolve […] We additionally imagine customers ought to have extra management over how ChatGPT behaves and, to the extent that it’s protected and possible, make changes in the event that they don’t agree with the default habits.”

Source hyperlink

OpenAI explains why ChatGPT turned too sycophantic

Recent Articles

X is rolling out help for 4K video uploads

Sarah Tavel, Benchmark’s first girl GP, transitions to enterprise companion

Nintendo’s new Switch 1 replace is getting issues prepared for Switch 2

A Canadian mining firm desires Trump’s permission to mine the deep sea

Microsoft CEO says as much as 30% of the corporate’s code was written by AI

Related Stories

Leave A Reply Cancel reply

Stay on op - Ge the daily news in your inbox