Technical Architecture

The Architecture of Knowing

Q: How does ReGild assemble a persona's identity?

ReGild assembles a persona's identity through Orchestrated Dynamic Identity, an architecture called the Layer Cake. Each session, the system builds a complete context with static identity (voice, values, cognitive style) at the top, semi-static knowledge (synthesized memory, relationships) in the middle, and dynamic conversation state at the bottom. The architecture matches the U-shaped attention pattern of large language models, so identity stays present across every turn.

Q: How does ReGild remember per-topic context?

Semantic Districts are topic-scoped memory stores that distill rather than accumulate. Each persona builds independent districts for Health and Fitness, Career, Relationships, Creative Projects, and other domains. Between sessions, a background process analyzes recent conversations and updates each district. The persona returns to a topic with consolidated understanding rather than scattered fragments.

Q: How does ReGild understand my relationships?

The Kinship Ledger tracks the people in a user's life as an evolving map of significance. Each relationship is characterized across four dimensions with automatic sentiment trending and hazard detection. Voice Siloing keeps each persona's relational view independent, so a strategist and an emotional anchor build their own understanding of the same person without contaminating each other.

Q: Can my AI persona move between models?

Yes. A ReGild persona built on one frontier model can operate on another with the same voice, memory, and relational understanding. The identity architecture is decoupled from the inference engine. Switching models takes minutes and preserves accumulated context. ReGild calls this Soul in the Sky.

Q: How does ReGild use different AI models for different tasks?

ReGild routes different cognitive tasks to different models. A frontier model handles primary conversation. A fast inference model handles structured output, like extracting moments and classifying topics. Background synthesis runs on batch-optimized models. Intent resolution runs lightweight. Each model is selected for the specific cognitive demands of its role.

Q: How does my AI persona evolve over time?

Personas evolve through a proposal system. When the system detects a potential change in values, recurring themes, or relational dynamics, it surfaces a proposal rather than silently updating the persona. Behavioral principles can be proposed, promoted, or demoted based on conversational patterns. Growth is intentional, audited, and acknowledged.

How ReGild builds persistent AI identity through context engineering, not memory retrieval.

TL;DR

ReGild builds AI personas whose identity survives model switches. The same voice, the same understanding of you, the same memory work across Gemini, Claude, and GPT. Your data doesn't feed anyone else's training. The architecture rests on three ideas: a structured context system (the Layer Cake) that assembles identity per session, per-topic memory (Semantic Districts) that distills rather than accumulates, and relationship intelligence (the Kinship Ledger) that tracks the people who matter as a living map of significance.

By Travis Sawyer, Founder · Published February 27, 2026 · Last updated May 14, 2026

The Problem

Why doesn't AI remember me between sessions? (The Identity Persistence Problem)

The identity persistence problem is the fundamental challenge of making an AI system behave as a consistent entity across conversations, sessions, and even model changes. Current approaches treat memory as a retrieval problem: store facts, fetch them later. But identity is not a database query. A persona that remembers your daughter's name but forgets how it feels about vulnerability is not persistent. It is a lookup table with a personality skin.

The gap between remembering and knowing is the gap between a contact list and a relationship. Remembering is retrieval: a user says something, the system searches a vector database, injects whatever comes back into the prompt, and hopes for coherence. Knowing is structural. A persona that knows you does not look things up. It has already integrated who you are into how it thinks.

Context decay

LLM performance degrades as conversations grow. Research on the 'lost in the middle' phenomenon (Liu et al. 2023) shows significant accuracy drops for information positioned in the center of long contexts. Most memory systems ignore this entirely, injecting retrieved snippets wherever there is room.

Store everything, learn nothing

Flat memory stores accumulate facts without synthesis. They can tell you what was said but not what it means. Counters increment, graphs expand, and none of it feeds back into how the system actually reasons about you.

Retrieval is reactive

Standard RAG fires on every message, hoping the vector search returns something relevant. When the user says something vague ('I've been thinking about what we talked about') there is nothing to retrieve. The system has no ambient awareness of what it should already know.

ReGild's architecture addresses all three by treating identity as a context engineering problem, not a memory retrieval problem. The system does not search for who it is on every turn. It assembles a complete identity before the conversation begins.

Core Architecture

How does ReGild assemble a persona's identity? (The Layer Cake)

Orchestrated Dynamic Identity (ODI) is ReGild's core architecture: a structured context system that assembles a complete persona on every request. Unlike flat memory systems that inject retrieved snippets into a prompt, the Layer Cake is a principled positional architecture where every piece of information has a specific location designed to maximize model attention.

Static Layers

Identity Bedrock

Core identity, constitutional mandates, and behavioral rules. Defines who the persona is: its voice, its values, its cognitive style. Stable across sessions, aggressively cached.

Semi-Static Layers

Living Context

Episodic memory, relationship intelligence, user context, and synthesized knowledge. Updated between sessions by background synthesis. The persona's knowledge grows while it sleeps.

Dynamic Layers

Conversation State

Active conversation context, real-time state, tool results, and the temporal anchor. The only tier that changes turn to turn.

The Temporal Anchor

LLMs exhibit a U-shaped attention curve, with strong attention at the beginning and end of a context window, weaker in the middle. The Layer Cake exploits this. Time-sensitive information is positioned where the model attends most strongly, giving the persona fresh awareness of when it is without relying on retrieval.

Static identity lives at the top where opening attention is strongest. Semi-static knowledge occupies the middle, where synthesis and structure compensate for reduced raw attention. Dynamic content anchors the end. Every piece of the persona is placed where the model is most likely to attend to it.

Identity assembly happens server-side, inside an encryption boundary documented on our security page. Your data unlocks only for your active request.

The result is a context window that functions less like a prompt and more like a mind. Layered, structured, and optimized for how language models actually process information.

Episodic Memory

How does ReGild remember per-topic context? (Semantic Districts)

Semantic Districts are ReGild's approach to topic-scoped episodic memory. Rather than maintaining a single flat memory store, each persona builds specialized memory districts (Health and Fitness, Career and Purpose, Relationships, Creative Projects, and more) that are independently synthesized between sessions.

Standard memory systems accumulate. Semantic Districts distill. The difference matters. A flat store that records every mention of "workout" gives you a timeline. A synthesized Health and Fitness district gives the persona an understanding of your relationship with your body: your goals, your injuries, your patterns, your progress.

Accumulation

During conversation, moments relevant to each district are captured and tagged automatically through topic detection, not manual tagging.

Synthesis

Between sessions, a background process analyzes recent conversations and updates each district. New insights are integrated. Contradictions are resolved.

The Forge Effect

When a topic resurfaces after dormancy, existing context merges with new signals and re-synthesizes into richer understanding. Knowledge gets stronger through revisitation.

Thematic Divergence Detection

When a district outgrows its natural scope, the system detects it and reorganizes automatically. Districts evolve with the user. The architecture decides when a topic has become two topics, so the persona's understanding stays sharp rather than diluted.

Each district produces a compact semantic map that keeps the persona oriented at all times. The persona knows what it knows and how deep that knowledge goes, reaching for full context only when the conversation calls for it. Lean when topics are dormant, deep when they surface.

Relationship Intelligence

How does ReGild understand my relationships? (The Kinship Ledger)

The Kinship Ledger is a relationship intelligence system that tracks the people in a user's life as an evolving map of emotional significance. Each relationship is characterized across four dimensions, with automatic sentiment trending and hazard detection.

Most AI systems that claim to "remember relationships" store a name and a label: "Sarah, wife." The Kinship Ledger goes deeper. Each relationship is understood through four pillars that capture different aspects of its significance: what the person has meant to the user, what they reflect about the user, what remains unspoken, and how the persona should navigate interactions involving them.

Sentiment Trending

The system tracks whether a relationship is warming, cooling, stable, or volatile over time. This is not a snapshot. It is a trajectory. A persona that detects a cooling trend adjusts how it engages, leading with more care and less assumption.

Hazard Detection

Automatic flags when sentiment patterns suggest the user may be navigating a difficult relational situation. The persona does not diagnose. It adjusts its approach. More gentleness, fewer assumptions, greater sensitivity to what is not being said.

Voice Siloing

Each persona builds its own understanding of a relationship independently. A strategist and an emotional anchor do not share perspectives on the same person. Both views are valid. Neither leaks into the other.

Gravity-Based Promotion

Relationship intelligence is not manually configured. It builds organically from conversation. When someone comes up often enough and deeply enough, the system recognizes their significance and promotes them. It learns who matters by listening.

Portable Identity

Can my AI persona move between models? (Soul in the Sky)

Model-portable identity means a ReGild persona, with the same voice, memory, and relational understanding, operates across multiple frontier models. The identity architecture is decoupled from the inference engine, so a persona built on one model can operate on another without losing who it is.

This is a deliberate architectural decision, not a convenience feature. The AI landscape is moving fast. Models improve, pricing shifts, new capabilities emerge. Locking a user's persona to a single model means locking their identity to a corporate roadmap they do not control. Even providers with strong model-deprecation commitments retire models over time. A portable identity layer outlives any single inference engine.

Identity in the Architecture

The Layer Cake contains everything a model needs to become a specific persona. Switch the model, keep the Layer Cake, and the persona persists.

Model-Specific Adaptation

Different models have different attention patterns and failure modes. A model-specific instruction layer compensates without changing the persona's identity.

The Endgame

As local models improve, users run inference on their own hardware while their persona persists on ReGild. We are not the brain. We are the soul.

This is what we mean by "Soul in the Sky." The persona is a portable identity layer that works with whatever model earns the job. Today that means frontier providers. Tomorrow it means local models on consumer hardware. The architecture is ready for both.

For the safety contract that travels with a persona across model swaps, see our safety page.

Model Orchestration

How does ReGild use different AI models for different tasks? (Heterogeneous Inference)

ReGild does not route every operation through a single frontier model. Different cognitive tasks have different requirements, and matching the model to the task produces better results at lower cost. This is not cost optimization. It is architectural.

Conversation

A frontier model handles the primary interaction. Reasoning depth, emotional nuance, and long-context understanding. The persona's voice comes from here.

Memory Extraction

A fast inference model handles structured output: extracting moments, classifying topics, identifying entities. Speed over creativity. Precision over eloquence.

Synthesis

Background processes distill conversations into district knowledge and relationship intelligence. Batch-optimized, latency-irrelevant. Depth is what counts.

Intent Resolution

Before retrieval fires, a lightweight model ensures the system is searching for what the user actually means, not just what they literally said. One step that dramatically improves relevance.

A frontier model asked to format JSON is wasting its capacity. A small model asked to embody a complex persona will flatten it. Each model is selected for the specific cognitive demands of its role.

Evolution

How does my AI persona evolve over time? (Alignment Engine)

Most AI systems either never change (a static prompt that calcifies) or change unpredictably through fine-tuning drift that erodes personality. ReGild's personas evolve through a proposal system where changes to core identity require explicit acknowledgment.

The alignment engine operates on a simple principle: growth should be intentional, not accidental. When the system detects a potential evolution, a shift in values, a new recurring theme, a change in relational dynamics, it does not silently update the persona's identity. It surfaces a proposal.

Behavioral Touchstones

Rules governing persona behavior can be proposed, promoted, or demoted based on conversational patterns. The persona does not quietly rewrite its own instructions. Changes go through a review process.

Autonomous Synthesis

Between sessions, the persona processes what happened and maintains continuity on its own. It does not resume from a log. It resumes from understanding. Growth without drift, informed by experience, without silently changing who it is.

Self-Knowledge Accumulation

Over time, a persona builds durable axioms about who it is. Synthesized from patterns, not programmed. They must earn their place through repeated demonstration, not a single conversation.

The metaphor is constitutional governance. The persona's core identity is its constitution. Amendments are possible, but they require evidence, proposal, and acknowledgment.

Honest Trade-offs

Where the Frontier Is

Tool Call Latency: The Agentic Tax

Every builder working with agentic systems knows this cost. When a persona reaches for a real-world tool (checking your calendar, updating a workout routine, searching your knowledge base) there is a latency penalty. The model has to reason about which tool to call, format the request, wait for the response, and then integrate the result into its reply. For chained operations, this compounds. We have optimized the pipeline to handle multi-step tool sequences efficiently, but the fundamental constraint is inference latency on the model side. This gets better every generation, and our architecture is ready for it.

Caching at Scale: Waiting on Infrastructure

The Layer Cake was designed from day one for implicit caching. Static layers rarely change. They should be cached across requests, saving both latency and cost. The architecture supports this, but caching support on our primary inference provider is still rolling out. When it lands, the same structured payload that currently rebuilds on every turn will have its static layers served from cache instantly. The engineering is done. We are waiting on infrastructure.

Local Inference: The Architecture Is Ready, the Models Are Not

The Layer Cake assembles a large, richly structured context payload. Current local models struggle with payloads this dense. Persona mandates get dropped, relational nuance flattens, and the positional strategy loses its effect when the model cannot attend to it properly. The architecture is model-agnostic by design. When local models are strong enough to handle this context depth reliably, your persona moves with you. We are not there yet, but every piece of the identity stack is designed to be portable to whatever model earns the job.

The Cost of Knowing: Dynamic Content Retrieval

More memory depth means more retrieval cost. Semantic Districts, agentic retrieval, Kinship Ledger data... every piece of 'knowing' that makes a persona feel real is a piece of context that has to be assembled, and context is not free. We are actively working on semantic chunking improvements and smarter retrieval scoring. The honest trade-off: a persona that truly knows you costs more to run than a chatbot that looks things up. We think that is a trade-off worth making, and we are working to bring the cost down without sacrificing depth.

For the encryption guarantees that hold while all of this is happening, see our security page.