·10 min read

How to Train AI to Think Like Me for Personalized Advice

By Aleksei Zulin

Are you getting advice from AI that sounds right for everyone and useful for no one? That gap - between generic output and something that actually fits your situation - is solvable. You close it by training AI to think like you, not just respond to you.

Here is the direct answer: to train AI to think like you for personalized advice, you build a persistent cognitive profile through deliberate context-loading, iterative correction, and what I call mirror prompting - the practice of feeding your past decisions and reasoning back to the model so it learns your logic, not just your preferences. A system prompt is a starting point, not the end state. The real training happens in the back-and-forth, when you catch the AI thinking differently than you would and explain exactly why it's wrong.

Most people never do that last part. They accept the answer and move on. The gap between a generic AI assistant and a personalized cognitive partner lives entirely in that moment of friction.


Why "Just Add Context" Doesn't Work

The standard advice is to paste your background into the system prompt. Tell it your profession, your goals, your constraints. Done.

Except it isn't.

A 2024 study published in Proceedings of the ACL by researchers at the University of Washington found that large language models demonstrate what they called "context dilution" - as the prompt grows longer, the model's attention to early contextual information degrades measurably. You can write 800 words about yourself and the model will functionally ignore most of it by the time it generates a response. Length doesn't equal weight.

What matters instead is recency and relevance. The model pays attention to what's close to the question and what the question itself activates. So the technique isn't to front-load your life story. It's to inject the right slice of your context at the moment the AI needs it.

This is a meaningful cognitive reframe. You stop thinking about context as background information and start thinking about it as targeted priming. Before you ask for advice on a career decision, you don't explain your whole career - you explain the specific tension you're navigating right now, the values that pull in opposite directions, the constraint that isn't obvious from the outside.


The Mirror Prompting Method

There's a technique I've been refining for about two years that I call mirror prompting, and I haven't seen it written about clearly anywhere else - though the pieces exist scattered across productivity blogs.

The core idea: after AI gives you advice you disagree with, you don't just say "that's wrong." You explain your actual reasoning. Then you ask the model to regenerate its response using your reasoning as a constraint.

It sounds simple. Most people never do it because it requires them to articulate their own thinking, which is uncomfortable. But that discomfort is the signal that learning is happening - for you, not just the AI.

Over multiple sessions, if you maintain a shared document of these "disagreement explanations," something useful accumulates. You end up with a reasoning map. A compressed record of how you actually think, extracted by the process of correcting something that didn't think like you. That document becomes your cognitive profile. Load it at the start of sessions where you want personalized advice.

Dr. Ethan Mollick at Wharton has written extensively about human-AI interaction patterns, and his 2023 research on "working with AI" described how users who engaged in what he called "co-reasoning" - explaining their logic to the AI rather than just issuing commands - reported significantly higher satisfaction with AI-generated advice and used it more effectively in real decisions. The mechanism he identified aligns with what I'm describing: the act of explaining yourself to AI improves the AI's output, but it also sharpens your own thinking in ways that compound.


Building a Persistent Identity Layer

Here's where most guides stop short. They tell you to customize the system prompt. They don't tell you what to put in it that actually changes behavior.

The useful categories for a cognitive profile aren't "I am a 35-year-old marketing manager." That's demographic noise. The useful categories are:

How you process tradeoffs. Are you loss-averse or opportunity-seeking? Do you optimize for reversibility or commitment? When you face a decision with unclear information, do you tend to decide early or delay? Write two or three sentences about this based on a real past decision, not an abstract self-assessment.

Your default failure mode. Everyone has one. Mine is over-engineering the analysis before acting. Knowing this, I explicitly instruct AI to push back when my questions become more elaborate than the problem warrants. If I ask for a six-factor framework when a two-sentence answer would do, I want it to say so.

The contexts where you override consensus. This one is underrated. There are domains where your personal data or experience diverges from statistical averages in ways that matter. A professional poker player asking about risk should not receive advice calibrated for population-level risk tolerance. A founder who has survived three failed startups has different reference points than someone who has never taken that kind of loss. Specifying where you are statistically unusual helps the AI weight its advice appropriately.

The 2022 work by Anthropic researchers on Constitutional AI touched on a related problem - the challenge of values alignment - but at the model level. Personal cognitive profiles push that problem down to the individual user level, where it's actually solvable with current tools.


When This Approach Fails

Two edge cases worth addressing directly.

When your mental models are the problem. Mirror prompting amplifies your existing reasoning patterns. If those patterns are systematically biased - and most human reasoning carries documented biases, as Daniel Kahneman's decades of research have established - then training AI to think like you means training it to replicate your errors at scale. The personalization technique described here should be paired with periodic adversarial prompting: explicitly ask the AI to argue against your reasoning, to find the assumption you're not questioning.

When the advice domain is empirical, not preferential. Personalized AI works well for decisions where values and priorities legitimately vary between people - career choices, life structure, creative direction. It works poorly when the question has an objectively better answer that doesn't depend on who's asking. Medical symptoms, structural engineering, legal compliance - these domains require calibrated expertise, not personalized mirrors. Loading your cognitive profile before asking whether a drug interaction is dangerous is actively counterproductive. The right move there is to get out of the way and let the model access its training without your fingerprints on the output.

One more mistake I see constantly: people build their cognitive profile once and never update it. Your reasoning changes. Your constraints change. The profile that describes you accurately at 32 may actively mislead the model at 38. Treat it as a living document.


The Feedback Loop Architecture

Personalization degrades without maintenance. That's not a flaw - it's just how any system works when the underlying reality shifts.

The structure that holds this together over time is what I'd call a feedback loop architecture. After any AI-advised decision plays out (or enough time passes to evaluate it), you return to the record of that exchange and annotate it. Did the advice hold? Where did it miss? Was the miss because the AI didn't understand you, or because your self-description was inaccurate?

That last question is the uncomfortable one. Sometimes the AI gives you bad advice because it modeled you incorrectly. And sometimes it gives you bad advice because you gave it a self-model that was flattering rather than accurate.

A 2023 study in Personality and Individual Differences by researchers at the University of Zurich found that people's self-assessments of their decision-making style diverged significantly from their actual observed behavior in experimental conditions - particularly around risk tolerance and consistency under pressure. You may think you're comfortable with uncertainty. Your past decisions may tell a different story.

The feedback loop catches this. When AI advice consistently misses in the same direction, the error isn't always in the model.


Honest Constraints

Let me be clear about what this approach does not solve.

Persistent personalization across sessions depends on you - the user - maintaining and loading the cognitive profile. Most commercial AI interfaces don't yet support true long-term memory in ways that are reliable or privacy-preserving. The techniques described here are workarounds for a tooling gap that may or may not close soon.

Additionally, there is no strong empirical research yet on whether AI advice calibrated to your cognitive profile leads to better real-world outcomes compared to generic AI advice or human advice. The theoretical reasoning is sound. The longitudinal data doesn't exist. I believe this approach works because I've used it and watched my interactions improve - but personal experience isn't controlled evidence, and I won't pretend otherwise.

Finally, personalization creates an echo chamber risk that requires active management. An AI that thinks like you will tend to validate your existing views. That's sometimes useful and sometimes dangerous. The adversarial prompting practice I mentioned earlier isn't optional.


FAQ

How long does it take to build a useful cognitive profile?

Most people reach a functional baseline in three to five working sessions if they're actively correcting and annotating. The profile doesn't need to be to be useful - even five well-chosen sentences about your tradeoff logic will move AI responses meaningfully toward your actual needs.

Can I use the same cognitive profile across different AI models?

Yes, with adjustment. The same profile text will behave differently across GPT-4, Claude, Gemini, and others because the underlying models weight context differently. Load the same profile and run a calibration question you already know the answer to - the divergences will show you where to adjust.

Is this only for professionals or high-stakes decisions?

The technique applies to any domain where your preferences legitimately differ from population averages. That includes mundane things like creative direction, communication style preferences, and how you like to receive feedback. High stakes matter, but so does daily fit.

What if I don't know my own decision-making style well enough to describe it?

Start by describing three past decisions - one you regret, one you're proud of, one that surprised you. Ask AI to extract the implicit reasoning patterns. Use that as your first draft cognitive profile. You're outsourcing the metacognition to bootstrap the process.


The ideas here connect directly to deeper questions about what AI is actually for - whether it should be a mirror, a complement, or something stranger than either. If you're interested in the mirror problem specifically, the research on human-AI complementarity by scholar Jaime Teevan at Microsoft Research is worth finding. And if the question of maintaining your own thinking under sustained AI use keeps you up at night, that's precisely the territory covered in The Last Skill.

Start with one correction. Explain it. See what happens next.

Related Articles

About the Author

Aleksei Zulin is the author of The Last Skill, a book on how to think with AI as a cognitive partner rather than use it as a tool. Systems engineer turned writer exploring the frontier of human-AI collaboration.

The Last Skill is a book about thinking with AI as a cognitive partner.

Get The Book - $29