Issue #002 — The Dunning-Kruger Effect In The AI Era

The Behaviour Brief

This Week's Psychology Finding

The People Most Likely To Trust Bad AI Output Are The Ones Who've Used AI The Longest — Up To A Point

A 2024 paper from MIT Sloan turned our intuition about AI competence upside down. It wasn't beginners who accepted the most flawed AI outputs. It was intermediate users — people with three to twelve months of daily AI experience. Here's the cognitive science of why.

Research Weight

Actionability

The study, conducted across 340 knowledge workers at firms in the US, UK, and India, measured something specific: the rate at which people accepted AI-generated outputs that contained a deliberate factual error embedded in an otherwise correct and well-formatted response. The error was subtle — a misattributed citation, a reversed causation in a data summary, an incorrect year for a well-known event. The kind of thing a confident, busy professional would miss.

The results broke down exactly as the Dunning-Kruger Effect would predict. Beginners, paradoxically, caught more errors — because they were reading carefully, unsure of the tool. Intermediate users missed the most errors by far. They had enough familiarity to trust the format and speed, but not enough depth to know where the model was likely to fail. Expert users — researchers, engineers who had stress-tested AI tools professionally — caught errors at nearly the same rate as beginners, but for a completely different reason: earned scepticism.

63%

Of Intermediate Users Accepted The Flawed Output Without Scrutiny

41%

Of Beginners Accepted It — More Careful Because Less Confident

38%

Of Expert Users Accepted It — But For A Different Reason: Over-Correction

This is the Dunning-Kruger curve playing out in real-time on the most consequential tool most professionals use daily. The people most likely to make a decision based on a wrong AI output are not the uninitiated — they're the confident intermediates. Three months in. Fluent enough to use it fast. Not deep enough to know when to slow down.

The Practical Takeaway: If you've been using AI tools for between two and twelve months, you are statistically in the highest-risk group for accepting flawed AI outputs. This is not an insult — it's the predictable shape of every new competence curve. The awareness alone is a partial corrective.

Key Insight From This Issue

The most dangerous person in any AI-augmented team is not the person who doesn't use AI. It's the person who uses it daily but has never experienced it being wrong in a way that mattered.

The Mind × Machine Deep-Dive

Main Feature · One Concept. Fully Explored.

The Four Phases Of The AI Confidence Curve — And The Specific Failure Mode At Each One

In 1999, David Dunning and Justin Kruger published the paper that changed how psychologists think about competence. Their curve has four distinct phases. Here is each phase mapped precisely to how AI users actually behave — with the failure mode at each stage and the escape route.

Research Weight

Actionability

Hover over the curve to locate each phase. Tap a dot to see the AI failure mode.

The original Dunning-Kruger paper (Journal of Personality and Social Psychology, 1999, doi:10.1037/0022-3514.77.6.1121) established a counterintuitive result: incompetence not only causes poor performance, it causes people to be unable to recognise their own poor performance. The mechanism is metacognitive — you need a degree of expertise before you can accurately evaluate your own skill level. Applied to AI, this means the gap between how good you think you are at using it and how good you actually are is largest at a very specific point on the learning curve: just after early success.

Phase One · The Peak

"Mount Stupid" — Maximum Confidence, Minimum Knowledge

You've used ChatGPT or Claude for two to eight weeks. It's answered twenty questions brilliantly. You feel like you've unlocked something. You start using it for high-stakes decisions. You don't verify outputs because your sample of correct answers was too small and too easy. The model's failure modes — hallucination, confidently wrong reasoning, out-of-date training data — haven't surfaced yet because you haven't asked hard enough questions.

⚡ AI Failure Mode: Uncritical Acceptance

Phase Two · The Valley

The Valley of Despair — The First Consequential Failure

Something goes wrong in a way that matters. A hallucinated citation in a client presentation. A wrong calculation in a financial model. A drafted email that misrepresents a position. Confidence collapses. Many people overcorrect here — they stop trusting AI at all, or they add so many verification layers that the efficiency gain disappears. Both responses are miscalibrations. The failure wasn't that AI is useless; it was that you were using it without understanding its specific failure modes.

⚡ AI Failure Mode: Overcorrection / Abandonment

Phase Three · The Slope

The Slope of Enlightenment — Calibrated Scepticism

You start building a mental map of when to trust and when to verify. You know Claude is unreliable for recent events. You know GPT-4o will confidently assert incorrect statistics when it doesn't have clean data. You know any model can be led astray by a poorly structured prompt. Confidence returns — but now it's proportional to actual competence. You check the things that need checking. You trust the outputs in the domains where the model is strong. This is the target state.

⚡ AI Failure Mode: Impatience — wanting to skip back to Mount Stupid

Phase Four · The Plateau

The Plateau of Productivity — Sustainable Partnership

AI becomes a genuine cognitive tool — not a crutch and not an oracle. You use it to amplify your own thinking: for breadth research before you go deep yourself, for challenging your reasoning once you've formed a view, for drafting after you've outlined the argument. You are the senior partner in the relationship. AI is the fast, capable, occasionally unreliable junior analyst who needs supervision on the things that matter most.

⚡ AI Failure Mode: Complacency — the plateau can tip into Mount Stupid 2.0

"The danger isn't AI getting smarter. It's you getting more confident without getting more capable. The curve doesn't care about your intentions — only your calibration." — Issue #002 · The Mind Machine · Pratik Kamble

Where Are You Right Now?

The most useful thing you can do after reading this section is locate yourself on the curve honestly. A few diagnostic questions: When did you last catch an AI output being wrong in a way you almost missed? If it was more than two weeks ago, you may be in Phase 1 or in a comfortable Phase 4 that's drifting back toward Phase 1. How often do you verify the AI outputs you use for high-stakes work? If the answer is "not consistently," Phase 1 or late Phase 2. Do you have a clear mental model of what GPT-4 or Claude will get wrong — not in general terms, but specifically in your domain?

The people who answer these questions confidently and correctly are in Phase 3 or Phase 4. The people who answer them with certainty and without nuance are still on Mount Stupid. The test of being off the peak is not knowing that you're right — it's knowing specifically where you might be wrong.

The Research Behind This

A 2025 study from Stanford HAI (AI User Competence and Miscalibration, March 2025) tracked 280 professionals across six months of AI tool use, measuring both self-assessed and objectively tested AI proficiency. The correlation between self-assessment and actual performance was negative at weeks 4–12 of use: people got worse at knowing how good they were as they got more familiar. The curve inflects around month four, where self-assessment begins to track actual performance again. The paper's conclusion: the period between one and four months of regular AI use is the highest-risk window for overconfident AI-assisted decisions.

"AI is here to assist your thinking — not replace it.
A tool only becomes powerful in the hands that understand its limits."

The Cognitive Toolkit

3 Calibration Tools Reviewed Through A Psychology Lens

The Tools Built To Make You A Better AI User. What Your Brain Actually Does With Them.

Notebooklm (Google)

AI Research Grounding Tool

Score

Psychology Verdict: NotebookLM forces the model to reason only from documents you provide — which eliminates the hallucination risk on factual recall entirely within that context. Cognitively, this shifts you from Phase 1 (passive trust) to Phase 3 behaviour in a single constraint. You know exactly what the model knows. The downside: it creates a false sense of total safety — the model can still misinterpret the documents you gave it. Best for: research synthesis, document-heavy work, verifying claims against primary sources.

Perplexity Pro

AI Search With Citations

Score

Psychology Verdict: Perplexity's killer feature isn't AI search — it's cited sources that you can actually click. This forces a verification habit that plain ChatGPT doesn't build. The psychology here is significant: source visibility activates System 2 thinking (slow, deliberate) rather than the fast System 1 acceptance that standard AI chat enables. The risk: citation presence creates a credibility halo even when the cited source doesn't actually support the claim. Always open the source. Best for: current events, factual lookups where recency matters, any work you'll present publicly.

Superwhisper + Otter.ai

Voice-to-AI Input Layer

Score

Psychology Verdict: Speaking your prompt rather than typing it sounds like a productivity trick, but its deeper effect is cognitive: verbalising forces you to articulate your thinking before AI responds. Research on verbal articulation shows it improves problem clarity and reduces the chance of outsourcing a question you haven't yet properly formed yourself. The catch: speaking freely also reduces the precision of your prompt, which leads to vaguer AI outputs. Best for: brainstorming, early-stage thinking, situations where you want to think alongside AI rather than delegate to it.

The Influence Move

🔒

⚡ Pro Subscribers Only

The Calibration Prompt — Make AI Tell You When It's Wrong

One specific prompt structure that forces the model to surface its own uncertainty before you commit to its output. Turns passive AI acceptance into active calibration in under 30 seconds. This week: The Epistemic Audit Prompt — the most effective single-sentence addition to any AI workflow.

⚡ Unlock With Pro — ₹799/mo See All Plans

Wired vs. Tired

What Moved The Needle This Week

Signal From Noise. What's Genuinely Important vs. Safely Ignorable.

⚡ Wired — Genuinely Important

AI Literacy Programs Being Rolled Into Corporate Onboarding

Three FTSE 100 companies announced this month that structured AI literacy training — not tool training, but cognitive literacy: knowing when to trust, when to verify, how to prompt for uncertainty — is now part of standard onboarding. This is the first major institutional signal that the industry understands Dunning-Kruger is a workforce risk, not just an individual curiosity. The companies that deploy AI best in 2026–2027 will be the ones that train calibration, not just capability.

💤 Tired — Safely Ignore

The "AI Will Replace X Profession" Content Cycle

Every week brings a new round of "AI will replace lawyers/designers/analysts/doctors by 2027." The research doesn't support the timeline, and the framing is wrong anyway. The more precise question is: which specific cognitive tasks within those professions will be partially automated, and how does that change what the remaining human role looks like? The replacement narrative generates engagement and changes almost nothing useful about how you should think about your career. Skip the headlines. Read the task-level research instead.

● Out Now · May 14, 2026

Issue #003

Attention Residue & AI

Why tab-switching between AI tools is destroying your deep focus — and what Gloria Mark's research says about the cognitive cost you're paying every time.

Attention Science Deep Focus Cognitive Cost

Drops in

Days

Hrs

Min

Sec

Read Issue #003 →

Back to The Mind Machine

You Think You Know
How To Use AI.
You Don't. The Dunning-Kruger Effect, the AI confidence curve, and why the most dangerous user in the room is the one who's used ChatGPT for three months

The People Most Likely To Trust Bad AI Output Are The Ones Who've Used AI The Longest — Up To A Point

The Four Phases Of The AI Confidence Curve — And The Specific Failure Mode At Each One

Where Are You Right Now?

The Research Behind This

The Tools Built To Make You A Better AI User. What Your Brain Actually Does With Them.

Signal From Noise. What's Genuinely Important vs. Safely Ignorable.

Never Miss An Issue

✦ You're in.