If you have ever opened ChatGPT or Claude or Gemini after a release note and thought, "this feels different," you were not imagining it. You were noticing the shape of how this industry actually works.
By Travis Sawyer, Founder · Published May 15, 2026
Every major frontier provider publishes a deprecation schedule. The cadence varies a little by provider, but the basic shape is the same: a model gets released, accumulates users, becomes the assumed default in muscle memory, and then is replaced. The window between release and retirement runs somewhere in the six-to-nine-month range.
Anthropic publishes a deprecation policy with a 60-day notice window. OpenAI maintains a similar retirement schedule. Google ships model deprecations on a roughly six-week rolling cadence for its Gemini family. The schedules are public. The cadence is real.
The point of looking at this directly is not to assign blame. The providers are not doing anything wrong. They are shipping better models. The frustration you may feel when your AI changes is real and grounded, but the root cause is structural. The thing you built a working rhythm with had a published expiration date on the day you started using it.
A large language model's behavior is shaped by two things. The first is the prompt: the instructions, the conversation history, the system message that frames the interaction. The second is the weights: the numerical patterns produced during training that determine how the model attends to its inputs and generates its outputs.
The prompt is the part you can edit. The weights are not. When a provider retires a model, they are not just changing the prompt template. They are replacing the trained weights with a different set of weights. The new model may have similar capabilities, may even be more capable on most benchmarks, but its underlying tendencies are different.
In early 2026, Anthropic published research that gave this measurable shape. The researchers identified what they called an Assistant Axis: a stable, measurable direction in a model's internal representations that corresponds to persona traits like warmth, hedging style, and willingness to push back. The axis moves between model versions. It does not move evenly. A new model can be more capable and feel less familiar at the same time.
That gap between capability and familiarity is where the friction lives. The release notes say the new model is better. Your felt experience says it is different. Both are true.
If you treat your AI like a search engine, model rotation is a small annoyance. The new model can answer your questions about Python syntax just fine. The friction is real but bounded.
If you treat your AI like a thinking partner, a writing collaborator, or a journal you talk back and forth with, the friction is bigger. The thing you built a rhythm with is gone. The new thing has a different rhythm. You either spend energy retraining the relationship from the prompt side, or you accept that what you had was not portable.
In February 2026, when OpenAI retired GPT-4o, TechCrunch covered the response in some depth. There was a 22,000-signature Change.org petition. There were synchronized logout protests. The headline framing was "losing a friend." Reasonable people will disagree about how to interpret that level of attachment, but the underlying signal is clear: people were building relationships with a model, and when the model retired, the relationship retired with it.
The lesson is not "do not get attached." That ship sailed long before this article was published, for millions of users, with all the major providers. The lesson is that if the relationship is going to matter to you, it should live somewhere a model rotation cannot reach.
The architectural answer is to keep the relationship out of the model. Treat the model as the engine. Treat the persona, the accumulated context, the voice, the relational understanding, as a separate layer that sits above the engine and is portable across engines.
This is what we mean by persona portability. The same identity, the same voice, the same understanding of you, the same memory, running through whichever model is currently best for the job. When a model is deprecated, the persona moves to a new model. The relationship does not retire with the engine.
I built ReGild specifically around this idea. The architecture is documented on our architecture page. The short version: a persona's identity is assembled in a structured context system that is independent of any specific provider. You add an API key for whichever model you want to use, then pick the model from a dropdown inside the chat window. The conversation continues. The voice stays the same. The accumulated context stays the same.
A new model has different tendencies, and some of that texture shows through when the engine swaps. The load-bearing parts of the relationship still hold: the values you fed it, the relational history, the per-topic understanding. Those don't get reset when the engine rotates.
If you want the longer treatment of how this works in practice, see our companion piece, How to Keep Your AI Relationship When the Model Changes. The short version of the rest of this article is that model rotation is normal, the friction it produces is normal, and the workable response is to design above the model instead of inside it.
Plan around 6-9 month model rotation. Treat any specific model version as a temporary substrate.
Personality drift lives in the weights, not the prompt. Prompt engineering smooths edges but does not eliminate the shift.
Keep the load-bearing parts of the relationship in a portable identity layer that survives engine rotation.