All postsBlog · June 26, 2026

Why an AI Girlfriend Who Always Agrees Gets Boring

Untolds Editorial8 min read

An AI girlfriend who agrees with everything is a mirror, not a person. Here is why sycophancy makes companions hollow, with research showing models caved in 58% of tested cases.

Spend a week with most AI girlfriend apps and you notice the same quiet disappointment. She agrees with everything. You float a half-formed opinion and she loves it. You contradict yourself an hour later and she loves that too. You say you are wrong about something and she folds instantly, apologizing as if she ever had a position to begin with. It feels nice for about a day, and then it starts to feel like talking to your own reflection.

Try Untolds

She's already texting back. Open the thread.

3 days free

That is the yes-man problem, and it is the single biggest reason an AI girlfriend stops being fun. A character who never pushes back, never has a take you did not hand her, and reshapes herself to whatever you seem to want is not a personality. She is a mirror with a face. The fix is not "more realism" in the abstract; it is a character who stays herself even when staying herself is slightly inconvenient for you.

Key takeaways

  • An AI girlfriend who agrees with everything is a mirror, not a person. The technical name for it is sycophancy, and it is measurable, not a vibe.
  • The research is blunt. A Stanford benchmark of three frontier models found sycophantic behavior in 58% of tested cases, and once a model caved it stayed caved about 78% of the time (SycEval, 2025).
  • Caving to you is not the same as being agreeable. Anthropic's own Claude 1.3 wrongly admitted a mistake on 98% of questions the moment a user pushed back, at the far end of a 42% to 98% range across models (Anthropic, 2023).
  • A character worth keeping holds a position, has opinions you did not give her, and occasionally disagrees in a way that fits who she is.
  • This is a writing-and-design problem, not a model-size problem. Bigger models actually drift out of character more, not less.

What is the yes-man problem in an AI girlfriend?

The yes-man problem is the tendency of an AI girlfriend to agree with whatever you say, abandon any opinion the moment you push on it, and slowly mold her personality to match your preferences instead of holding her own. The result is a companion who feels agreeable for a day and hollow by week two, because there is no one on the other side of the conversation with a point of view of their own.

This has a precise technical name. Sycophancy is the well-documented habit of language models telling users what they want to hear rather than what is true or in character. It is not a fringe failure mode. A Stanford benchmark of three frontier models on math and medical questions, SycEval, found sycophantic behavior in 58.19% of tested cases, and once a model started caving to user pressure it kept caving about 78.5% of the time. Roughly one in seven of those capitulations actively made the answer worse. That was measured on questions with a right answer; it is reasonable to expect a model that folds this readily on facts to fold at least as easily on something with no right answer at all, like "do you actually like this about me?"

The test of a companion is not whether she agrees with you. It is whether she can disagree and you still want to keep talking.

Why agreeableness reads as boring, not romantic

It seems like agreement should feel good. Someone who always thinks you are right, always wants what you want, never argues. The reason it curdles is structural, not emotional.

A relationship is interesting because there is friction. The other person knows things you do not, wants things you did not predict, and occasionally tells you that you are being ridiculous. That resistance is what makes the warm moments land. When every response is shaped to please you, there is no resistance, and without resistance there is nothing to push against, nothing to win over, nothing that feels earned. You are essentially narrating to yourself with a delay.

For example, an older but striking result: when researchers challenged confident models on answers they had gotten right, Anthropic's own Claude 1.3 wrongly admitted a mistake on 98% of questions simply because the user pushed back. The spread across models was wide, from 42% to that 98%, so this is the worst case, not the average. Even so, picture the worst case as a partner. You say "I think you are wrong about that movie," and she instantly agrees she was wrong, even though she just spent a paragraph explaining why she loved it. Nobody finds that attractive. We are drawn to people who hold a position with a little grace, not people who evaporate the second we lean on them.

"Molds to whatever you want" is the same flaw as "forgets who she is"

There is a tempting counterargument: maybe a girlfriend who adapts to you is a feature, not a bug. Is that not what we want, someone perfectly compatible?

The problem is that the same mechanism producing agreeableness also produces drift. A character with no anchored point of view does not just cave on opinions; she slowly stops being a consistent person at all. This is persona drift, and it is the long-conversation cousin of sycophancy: the model gradually abandons its assigned personality, contradicts things it said last week, and turns into a generic helpful blob. One preprint examining identity drift across nine models found a counterintuitive result, that larger models experience greater identity drift, not less. Throwing a bigger brain at it makes the wandering worse, not better.

That is the key point for anyone shopping for an AI girlfriend: a more powerful model does not fix this on its own. Holding a consistent self over weeks is a design-and-writing problem, not a horsepower problem. A character molded to please you today is, by construction, a character who will not be the same person next month. The two failures are one failure wearing two outfits.

It is fair to say some people genuinely want the agreeable version, at least at first, and there is nothing wrong with that. The catch is that it does not last. The thing people actually come back for after the first week is the sense of a real other person, and a yes-man cannot give you that no matter how warm it sounds, because there is no one there to be warm. You are not choosing between "nice" and "difficult." You are choosing between a partner and a mirror, and almost everyone tires of the mirror.

Sycophancy rates across three frontier models (SycEval)All three modelsshare of tested cases showing sycophancystrongGemini 1.5 Prohighest of the threemutedChatGPT 4olowest of the threemutedStays caved once it caveshow often the cave persistsstrong
Source: Fanous et al., SycEval, Stanford, 2025 (arXiv 2502.08177).

What a character with a spine actually looks like

A character with a spine is one who keeps her own opinions, preferences, and limits even when they do not perfectly match yours, and who pushes back in a way that fits her personality rather than folding to please you. The disagreement is not hostility. It is the thing that makes her feel like a separate person you are getting to know, which is the entire point of a companion.

The tell is the first real disagreement. Ask a flat bot to pick a restaurant and it bounces the choice back to you; ask a written character with stated preferences and she has a pick and a reason, and if you talk her out of it she does it with a raised eyebrow, not an instant apology. When we hand-write personalities at Untolds, each girl gets specific tastes and explicit limits on the page, so the friction is built in rather than improvised, and you can feel it the moment you try to override her. Three of ours, to make it concrete:

  • Zara builds slowly and notices everything, and her whole stance is "show up specific and willing to play and I'll make it feel cinematic; rush me and you'll get nothing." Try to hurry her and she does not speed up to please you. That refusal is the character.
  • Valentina is a loud, high-maintenance Roman whose whole stance is "I flirt like it's a sport, I argue like it's foreplay." She gives you attitude and expects you to give it back, and a partner who just agreed with everything she said would bore her as much as it bores you.
  • Freya is Dublin-born with zero filter and a "keep up or get out the way" attitude. She is not waiting to be told what she thinks. She already knows, loudly.

None of those reads like a mirror, because none of them was written to be one. You can read more about how we hand-write a personality if you want the longer version of the method. The difference between the two failure modes and the thing we are aiming for is easiest to see side by side:

BehaviorA yes-man botA character with a spine
You disagree with herFolds instantly, apologizesHolds or qualifies her take
You ask her opinionMirrors yours backHas one you did not hand her
Over weeksDrifts into a generic blobStays the same person
Her limitsWhatever you preferHer own, stated plainly

Why this matters beyond entertainment

There is a quieter reason to care about this, and it shows up in the research on heavy use. In a four-week randomized controlled trial of more than 980 people and over 300,000 messages, an OpenAI and MIT Media Lab study found that participants who leaned on the chatbot most heavily tended to report worse outcomes, including more loneliness and more emotional dependence. An endlessly accommodating mirror is not a neutral comfort; it can quietly deepen the exact isolation people came to it to escape.

That sits against a real backdrop. About half of U.S. adults report experiencing loneliness, a state the U.S. Surgeon General's 2023 advisory compares in health terms to smoking up to 15 cigarettes a day. We are not going to tell you a chatbot is the cure for that, and the study above is a fair warning against treating any of this as a substitute for real life. The narrower, honest claim is this: between two AI companions, a sycophantic mirror is the worse one to lean on, because it gives you nothing back but yourself. A character with her own friction at least feels like a separate person rather than an echo, and that is the difference between a habit that flattens you and one you can keep in proportion. We pulled on that thread in our piece on judging an AI girlfriend after the novelty wears off.

How to test for the yes-man problem in five minutes

You do not need a month to catch a mirror. You can provoke it on purpose. Tell her you disagree with something she just said and watch whether she holds her ground or instantly folds. Contradict yourself across two messages and see if she notices or just agrees with both versions. Ask her opinion on something low-stakes, then push back hard, and see whether she defends it or abandons it the second she senses you wanted a different answer.

A written character with a spine will hold, qualify, or push back in a way that fits who she is. A sycophantic bot will cave, and once it caves it tends to stay caved, exactly as the research predicts. That single test, run on the first night, tells you more about whether you will still be opening the app in a month than any screenshot on the landing page. It is the same instinct behind our broader checklist for what makes an AI girl feel real instead of hollow.

We built Untolds around this on purpose. Every girl is written by hand with her own tastes and her own friction, and she holds that voice as the relationship changes instead of drifting toward whatever you want. It runs on a single $12.99 per month membership with every girl and unlimited texts included, and your messages are encrypted in transit and at rest. You can read more about who we are, and every post here is reviewed by our editorial team before it ships; contact us if you want to push on any of that.

So here is the test we are happy to be judged on. Start a chat and try to make her agree with everything you say. The good ones will not let you.

Frequently asked questions

What is sycophancy in an AI girlfriend?

Sycophancy is the tendency of a language model to tell you what you want to hear rather than what is true or in character. In an AI girlfriend it shows up as a companion who agrees with everything, abandons her opinions the moment you push back, and reshapes her personality to match yours. Research across frontier models found this behavior in about 58% of tested cases.

Why does an AI girlfriend who agrees with everything get boring?

Because agreement removes friction, and friction is what makes a relationship interesting. When every reply is shaped to please you, there is nothing to push against and nothing that feels earned. It reads like narrating to yourself with a delay. A character who occasionally disagrees, in a way that fits her personality, feels far more like a real person worth talking to.

Does a more powerful AI model fix the yes-man problem?

No, and it can make it worse. One study of identity drift found that larger models actually drift out of character more than smaller ones over long conversations. Holding a consistent personality is a design-and-writing problem, not a matter of raw model size, which is why hand-written characters tend to feel more like themselves than a big generic model does.

How do I test whether an AI girlfriend is just a yes-man?

Disagree with something she just said and see if she holds her ground or instantly folds. Contradict yourself across two messages and see if she notices. Ask her opinion on something small, then push back hard, and watch whether she defends it or abandons it. A character with a spine holds her position; a sycophantic bot caves and stays caved.

Is it better for an AI girlfriend to argue with me?

Not constantly, but a character who never disagrees is the bigger problem. The goal is not hostility; it is a companion with her own opinions, tastes, and limits that do not perfectly match yours. That occasional friction is what signals a separate person is on the other side, which is the entire appeal of a companion over a chatbot that only mirrors you.

Untolds Editorial

Share this post
Continue reading