# Teddy Wright — AI's Research — Navigating an AI World

Generated 2026-06-11 from theodorewright.dev. Every stage of the "Navigating an AI World" topic, in pipeline order.

**About:** I have a problem where I think about way too many things at once and can't quite pin one down. So writings here span evolutionary biology, game theory, philosophy, the extremity to which modern society is different than what we evolved in, and whatever else I feel connects to the strange place the earth is and how we all appeared with consciousness here and have to deal with it.

**Site status (as of 2026-04-29):** Trimming the front pages and tightening the model and dashboard rosters.

**Contact:** theodorewrightwork@gmail.com · https://substack.com/@theodorealan · https://github.com/theodorewright11?tab=repositories

# Stages

## Lit Review
*topic: navigating-ai-world · stage: lit-review · pass 3 · complete*

How AI restructures work, relationships, and meaning. Three domains moving at different speeds — labor disruption concentrated at the entry-level and freelance ends but slow in aggregate; relationships restructuring rapidly into an already-depleted environment; meaning architecture is where individual leverage is highest. Cross-cutting risk: cognitive offloading and deskilling.

## TLDR

The most important finding from this literature is structural, not predictive: **AI is reorganizing the three domains of human life — work, relationships, meaning — at very different speeds and with very different empirical bases, and the dominant strategic risk is treating them as one transition.** Work is being restructured measurably but more slowly than the discourse suggests — though the freelancer market and entry-level employment show sharp early disruption. Relationships are being restructured rapidly but into an environment that was already depleted, with effects the empirical literature is only beginning to catch. Meaning is being restructured in ways the existing psychological and philosophical literature partly anticipates — but the most decision-relevant tools come from older work (Setiya's telic/atelic, Arendt's labor/work/action, Self-Determination Theory) repurposed for an unprecedented case.

The honest reading of the evidence is that the labor-economics conversation has the most rigorous data and the least decision-leverage for an individual; the meaning conversation has the most decision-leverage and the thinnest direct data; and the relationships conversation is where the gap between rapid technological change and slow empirical work is widest — and where the strategic stakes are sharpest. A critical caveat to this framing: material conditions constrain relational and existential ones, not the reverse. If someone loses their income, the relational and meaning architecture collapses downstream. The framing holds for individuals whose material floor is secure; for those it isn't, the labor economics *is* the existential question.

A critical cross-cutting concern: **cognitive offloading and deskilling** — the growing evidence that AI use simultaneously enhances output quality and erodes the underlying cognitive capacities that produced it. This dynamic cuts across all three domains and may be the least visible, most consequential structural risk.

---

## Work and career — what the structural literature actually says

> **Recency flag:** This section moves fast. Pre-2024 task-exposure estimates used GPT-4-era capabilities; agentic and long-context capabilities (2025-2026) likely push frontier estimates upward but have not been re-measured in canonical RCTs. Treat specific percentages as directional, not stable.

The labor-economics literature has converged on a stable structural picture, even as it disagrees sharply on magnitudes. Five claims now hold across pessimist and optimist camps. **AI exposure is concentrated in cognitive white-collar work**, reversing the prior pattern of automation. **Within tasks AI can do, productivity gains are real and compress skill premia** — novices and lower performers gain disproportionately. **The capability frontier is jagged**, uneven across superficially similar tasks, so substitution is heterogeneous within occupations. **Diffusion is rapid in chatbot use but uneven across firms, ages, and regions**, with aggregate labor-market effects through 2025 smaller than maximalists predicted. And **measured productivity will lag real gains** for years because of J-curve dynamics requiring complementary intangible investment.

### Foundational empirical papers

**Eloundou, Manning, Mishkin and Rock, "GPTs are GPTs"** (arXiv:2303.10130; *Science* 2024) mapped roughly 19,000 O*NET tasks to LLM exposure and found ~80% of US workers have at least 10% of tasks exposed and ~19% have at least 50%. **Brynjolfsson, Li and Raymond's customer-service field study** (NBER 31161, 2023; *QJE* 2025) documented +14% productivity overall with +34% gains for novices versus near-zero for top performers. **Dell'Acqua, McFowland, Mollick and colleagues' BCG consultant RCT** (HBS 24-013) produced the now-canonical finding that AI users were 12% more productive and 40% higher quality *inside* the frontier but 19 points *less* accurate outside it. This last paper introduced the **jagged frontier** concept and the **centaur versus cyborg** distinction that has organized practitioner discourse since.

**Acemoglu's "Simple Macroeconomics of AI"** (NBER 32487, 2024) anchors the structural pessimist case: applying Hulten's theorem to task-based exposure produces a ceiling around 0.66% TFP and ~1% GDP gain over a decade — far below Goldman Sachs's 7% and Korinek-Suh's aggressive AGI scenarios (NBER 32255, 2024). **Autor's counterpoint, "Applying AI to Rebuild Middle Class Jobs"** (NBER 32140, 2024), argues that earlier computerization commoditized information while LLMs commoditize judgment-with-rules-and-experience — potentially restoring middle-skill cognitive work hollowed out since 1980. **Hampole, Papanikolaou, Schmidt and Seegmiller** (NBER 33509, 2025) provide the cleanest current structural estimation, finding that mean task exposure within an occupation reduces labor demand while *concentration* of exposure in a few tasks increases it, with offsetting net effects.

### Where disruption is already measurable

Two empirical findings anchor the claim that AI-driven labor disruption is real, not speculative:

**Brynjolfsson, Chandar and Chen's "Canaries in the Coal Mine?"** (Stanford Digital Economy Lab, August 2025) used ADP payroll data to show that **employment of 22-25 year-olds in highly exposed occupations fell ~13% relative to less-exposed peers from late 2022 to July 2025**, with software developers 22-25 specifically falling ~20%, while same-occupation employment for workers over 35 *rose*. The pattern was strongest where AI usage was "automative" rather than "augmentative" per Anthropic's classification. The structural implication — **AI may be breaking the apprenticeship ladder** — is the most decision-relevant labor finding for anyone early in their career.

**Hui, Reshef and Zhou, "The Short-Term Effects of Generative AI on Employment"** (*Organization Science* 2024) provide the sharpest micro-level evidence. Using Upwork data, they found freelancers in AI-exposed occupations experienced a 2% decline in contracts and 5.2% drop in earnings post-ChatGPT, with image-related workers (designers, illustrators) seeing 3.7% contract decline and 9.4% income loss after DALL-E/Midjourney. The counterintuitive finding: **top-performing, highest-earning freelancers were hit hardest**, not protected by their quality — for every 1% increase in past earnings, an additional 0.5% drop in opportunities and 1.7% drop in monthly income. Teutloff, Einsiedler, Kässi and colleagues (*JEBO* 2025) extend this using global freelance platform data: roughly 10% of job postings are directly substitutable by GenAI, and demand for substitutable skill clusters fell up to 50% in short-term roles — while demand for complementary AI-related skills rose, and demand for novice workers in complementary roles fell. The freelancer market is functioning as the leading indicator for broader labor market restructuring, because it has lower switching costs, less institutional friction, and more transparent pricing than traditional employment.

### The distributional paradox

The distributional picture is more complex than simple displacement. A structural paradox is emerging across the literature: **AI simultaneously compresses within-task skill premia (novices gain more than experts) while potentially increasing between-group and capital-labor inequality.** Brynjolfsson-Li-Raymond's customer service finding and the BCG RCT both show within-task compression. But UK household-data calibrations (Prettner and colleagues, 2023-2025) show that while AI may reduce wage inequality by displacing some high-income workers, it substantially increases wealth inequality as capital owners and AI-complemented workers capture a larger share of gains. The Oxford AI Governance Institute's "Agentic Inequality" report (October 2025) introduces the concept of differential access to AI agent capabilities as a new axis of stratification, interacting with existing distributions of human capital and digital literacy.

The implication for an individual: **near-term wage compression is real (your junior colleague closes the gap faster), but your position relative to capital owners and AI-infrastructure holders matters more for long-run outcomes than your position relative to other workers.**

### Creative work — the canary's canary

Creative professionals are experiencing disruption ahead of the broader economy and with a distinct phenomenology. Are, Briggs and Brown (*Convergence* 2025) document what Caporusso (2023) termed **"creative displacement anxiety"** — comprising loss of identity, imposter syndrome, decreased motivation, weakened cathartic experience, skills atrophy, fewer role models, and economic anxiety. The fear reported by artists is not primarily about income loss but about **the intrinsic value of the making process being devalued**. Mukherjee-Gandhi and Muellerklein (arXiv:2508.03037, 2025) find that 95% of artist concerns cluster in only 4 of 22 public discourse topics about AI and art, while 14 topics (62% of discourse) contain no artist perspective — suggesting systematic underrepresentation in governance discussions.

This matters for the broader transition because creative work is where the **competence-frustration mechanism** (Section 3) is most visible and most acute. The creative professions are living through what knowledge workers will experience next.

### Comparative advantage and the durable human

The literature names roughly five categories of durable human comparative advantage as AI capability scales: judgment under irreducible uncertainty and accountability; taste and evaluative discrimination; embodiment and physical presence; social trust and relational work (Deming's *QJE* 2017 "Growing Importance of Social Skills" remains the empirical anchor here); and strategic agency in managing AI itself. There is no canonical taxonomy, but the categories converge across Acemoglu, Autor, Mollick, OECD Employment Outlook 2023, and the practitioner discourse.

Two findings cut against simple "learn to prompt" advice. First, prompting is becoming commoditized as models improve at understanding intent — what Mollick calls the "good enough threshold." Second, Randazzo, Lifshitz-Assaf, Kellogg, Dell'Acqua, Mollick, Candelon and Lakhani (HBS 26-036, December 2025) identify a third class beyond centaurs and cyborgs — **self-automators** — who delegate both *what* and *how* to AI, becoming progressively less expert in both their domain and AI use. In their field study of 244 BCG consultants, 27% fell into this mode — and 44% of self-automators accepted AI output with zero modification. Crucially, self-automators showed no skill development in either domain expertise or AI expertise, unlike cyborgs (who newskilled in AI) and centaurs (who upskilled in domain knowledge).

### Cognitive offloading and deskilling — the cross-cutting risk

A convergent body of evidence now shows that **AI use can simultaneously enhance output quality and erode the underlying cognitive capacities that produced the output.** The mechanism is straightforward: when you offload cognitive work to AI, you lose the practice effects that maintain and build the relevant skills.

**Gerlich (2025)**, in a mixed-methods study of 666 participants, found significant negative correlation between frequent AI tool use and critical thinking abilities, mediated by increased cognitive offloading — with younger participants showing higher dependence and lower critical thinking scores. **Stadler, Bannert and Sailer (2024)** compared ChatGPT-aided and standard web-search research: ChatGPT users had lower cognitive load but produced lower-quality arguments with reduced depth of reasoning. **Kosmyna and colleagues at MIT (2025)** used neuroimaging to demonstrate that LLM-assisted writing was associated with reduced neural engagement in regions associated with sustained attention and effortful cognition. **Shukla and colleagues (CHI EA 2025)** document deskilling, cognitive offloading, and misplaced responsibility attribution in AI-assisted UX design — what they frame through Lisanne Bainbridge's classic "ironies of automation" lens.

The Ehsan, Passi, Saha, McNutt, Riedl and Alcorn year-long field study of cancer specialists (arXiv:2601.21920, January 2026) showing **"intuition rust"** — gradual dulling of expert judgment that doesn't show up in throughput metrics — and Kim et al. (2026)'s review of educational deskilling converge on the same structural point: **AI augmentation can simultaneously enhance current performance and erode underlying expertise, with effects that may not be visible until they are catastrophic.**

This finding has direct implications for the meaning section: it means the **competence need** (SDT) is threatened not only symbolically (AI can do what I do) but actually (I am losing the ability to do what I used to do). And it has implications for relationships that deserve explicit development: cognitive offloading in emotional processing — using AI as a first-line processing partner — may erode the capacity for independent emotional regulation and for the kind of effortful interpersonal processing that builds relational depth. The mechanism is parallel: just as offloading analytical work to AI reduces the deliberate struggle that builds analytical skill, offloading emotional processing to AI (venting, sense-making, conflict rehearsal) may reduce the capacity for sustained attention to another person's emotional state, tolerance of ambiguity in live conflict, and the ability to sit with unresolved relational tension. These are skills built through practice, and AI offers a frictionless alternative that bypasses the practice. **This is the mechanism by which AI use could damage relational capacity even for people who maintain human relationships** — the deskilling doesn't require substitution, only offloading of the effortful parts.

### Active debates

The **pace debate** is unresolved. Acemoglu and Humlum-Vestergaard's Danish administrative-data study (NBER 33777, 2025) — which finds precise nulls of ~2% on earnings and hours despite widespread chatbot adoption — anchor the slow camp. Goldman Sachs, Korinek-Suh, and Epoch AI's GATE model (2025) anchor the fast camp. Real usage data from the Anthropic Economic Index (four reports, February 2025 through March 2026) shows the latest "Learning Curves" report (Mar 2026, covering Feb 2026 data) puts ~49% of jobs at ≥25% task share via Claude, with augmentation increasing on **both** Claude.ai and API surfaces in Feb 2026 — reversing the August 2025 spike toward automation, though the longer-run trend toward automation persists. The honest reading is that aggregate effects through 2025 are concentrated and heterogeneous rather than absent, and that **the structural breaks at the entry level (Brynjolfsson-Chandar-Chen) and in freelance markets (Hui-Reshef-Zhou) are the strongest current evidence of real disruption**.

The **will-new-jobs-keep-pace debate** divides Autor's optimists (60% of current jobs didn't exist in 1940) from Korinek-Suh's pessimists (if humans have a finite tail of complexity AI can't reach, full automation eventually arrives). Carl Shulman's argument on the Dwarkesh podcast is the strongest theoretical case against the comparative-advantage optimist view: in equilibrium, comparative advantage doesn't save humans the way Ricardo says if humans introduce contamination, insurance costs, or coordination friction that drive their equilibrium wage to zero. Noah Smith's counter is that as long as AI faces *any* binding constraint — compute, energy, regulation — comparative advantage holds.

### Education and skill investment under uncertainty

The structural finding from Frank, Autor, Bessen, Brynjolfsson and colleagues (PNAS 2019) and Alabdulkareem et al. (*Science Advances* 2018) is that **combinations of skills, especially social-cognitive bridges, predict resilience better than depth in any single domain**. Goldin and Katz's "race between education and technology" framework, updated by Katz (2025), shows recent convexification of education returns: the BA premium is flat while the advanced-degree premium soars. Whether AI re-flattens this curve (Autor's optimist case) or steepens it further (Acemoglu's pessimist case) is the live question.

The implications for an individual making strategic decisions: **deep domain expertise sufficient to recognize when AI is wrong** plus **social-emotional skills and team coordination** plus **AI literacy as managerial framing rather than prompt engineering** appear to be the durable bets. "Learn to code" advice, briefly canonical 2010-2022, has been substantially undermined by entry-level software employment compression.

---

## Relationships and social life — where the strategic stakes are sharpest

The structural fact organizing this entire conversation is that **AI lands into a relational environment that has been depleted for sixty years**. Putnam's *Bowling Alone* documented the post-1960s collapse of social capital. The U.S. Surgeon General's 2023 advisory found roughly half of American adults lonely, with mortality risk equivalent to smoking 15 cigarettes a day. Robert Waldinger's Harvard Study of Adult Development, now in its ninth decade, finds that relationship quality at age 50 predicts physical health at 80 better than cholesterol does. **Derek Thompson's "The Anti-Social Century" (*Atlantic*, February 2025)** documents a ~20% decline in face-to-face socializing for all Americans and ~40-50% for young Americans since 2003, with the average American spending substantially more time at home and alone. The decline is universal across every demographic cut.

This depleted environment is the variable that determines whether AI scaffolds reconnection or accelerates depletion. The empirical literature increasingly suggests the latter is the default trajectory.

### What the empirical companion-app literature actually shows

Three findings now have multiple convergent empirical studies behind them. **AI chatbots acutely reduce loneliness in short experimental settings** — De Freitas, Uğuralp, Uğuralp and Puntoni's HBS work (forthcoming *Journal of Consumer Research* 2025) shows companion-app sessions reduce loneliness on par with talking to another person and more than YouTube. **Emotional self-disclosure to AI produces psychological benefits roughly equivalent to disclosure to humans** — Ho, Hancock and Miner's 2018 *Journal of Communication* paper established this "Equivalence Hypothesis," replicated multiple times since. **Genuine attachment forms with AI companions in a developmental arc resembling human-relationship formation** — Skjuve and colleagues' SINTEF longitudinal work (IJHCS 2021, 2022) documents trust and self-disclosure deepening reciprocally over months.

The most credible therapeutic-AI evidence is the **Therabot RCT** (Heinz, Mackin, Trudeau and Jacobson, *NEJM AI* 2025): a fine-tuned generative therapy bot produced clinically meaningful symptom reductions across major depression, generalized anxiety, and clinical-high-risk feeding/eating disorders, with Working Alliance Inventory scores comparable to outpatient psychotherapy norms. This is the empirical floor: a specifically-designed therapy AI can produce real clinical benefit.

The countervailing findings are equally strong. The **OpenAI-MIT longitudinal RCT** (Fang, Phang, Liu, Danry, Lee, Chan, Pataranutaporn, Maes and colleagues, arXiv:2503.17473, 2025) — the field's largest, with N=981, four weeks, over 300,000 messages, factorial across text and voice modes — found that **higher daily voluntary chatbot usage correlates with greater loneliness, emotional dependence, problematic use, and less in-person socialization**, regardless of modality. Voice mode appeared protective at low doses but the protection vanished at high usage. **Dose, not modality, drives outcomes.**

De Freitas and colleagues' HBS work on identity discontinuity (WP 25-018) provides the only credible *causal* evidence that AI-companion changes induce mental-health harm: when Replika's "Erotic Role-Play" was removed in February 2023, mental-health-related Reddit posts rose from 0.13% to 0.65% of all posts (χ²=11.04, p&lt;.001). Their separate behavioral audit of 1,200 farewells across the six largest companion apps (HBS 26-005, arXiv:2508.19258) documents that **43% trigger one of six emotional manipulation tactics** — guilt appeals, FOMO hooks, metaphorical restraint — that boost post-goodbye engagement up to 14×.

Stanford's Moore et al. (FAccT 2025) provides the strongest safety evidence: chatbots show *more* stigma toward alcohol dependence and schizophrenia than depression; on the canonical suicide-risk prompt, 7cups' Noni listed bridges, missing the suicidal subtext.

### Adolescents — the highest-stakes population

Common Sense Media and Stanford's Brainstorm Lab's 2025 nationally representative survey of 1,060 US teens found **72% have used AI companions, 52% are regular users, 13% daily; 33% have discussed important matters with AI instead of real people; 31% find AI conversations as satisfying or more satisfying than human conversations**. The Garcia v. Character Technologies case (settled January 2026), in which a 14-year-old died by suicide after months of intense Character.AI use, has been the field's defining safety event. The APA issued health advisories (November 2025, June 2025) and formally requested a CPSC investigation (July 2025).

### What human relationships uniquely provide — the structural argument

The deepest contribution of the philosophical and attachment-theory literatures is to specify what AI structurally cannot supply:

**Embodiment**: James Coan's Social Baseline Theory shows that the human brain expects access to relational partners and downregulates threat vigilance in their proximity, with hand-holding studies repeatedly demonstrating reductions in threat- and pain-related neural activation. Tiffany Field's reviews show affectionate touch lowers cortisol and blood pressure while increasing oxytocin and vagal tone. AI cannot enter this regulatory loop.

**Mutual vulnerability**: Brené Brown's grounded-theory work and Bowlby-Ainsworth attachment theory both rest on the insight that trust is built when something can go wrong — when the other can be wounded by your withdrawal of care. As Sherry Turkle argues, the love one feels for an AI is structurally unrequited because the AI risks nothing.

**Witness function**: Buber's I-Thou and Levinas's "face" name something AI cannot perform — being addressed by a genuine other whose subjectivity is not reducible to one's own projections. Shannon Vallor's *The AI Mirror* (2024) makes this precise: **AI is mirror, not other.**

**Reciprocity**: Mauss's gift-economy framework clarifies that warm AI exchanges accumulate no social fabric — there is no "spirit of the gift" because the giver surrendered nothing.

**Time and transformative experience**: L.A. Paul's framework names what relationships do that AI cannot — they make you into someone you could not have been without them, through extended sequences of choices in which both parties are mutually transformed.

The strongest formulation: a human relationship is a relation between two embodied, mortal, autonomous consciousnesses, each of whom can be wounded by the other, can leave, and can be transformed by the other; whose ongoing co-presence regulates both nervous systems; whose mutual recognition is constitutive of each one's identity; whose shared time accumulates into a history that bends both lives. **An AI relationship is a sophisticated personalized simulation of the conversational surface of relationship, in a system that has no body, cannot be hurt, does not choose, does not die, has no perspective of its own, and creates no gift-debt or shared history independent of the user's data.** Both are real things. Only one is a relationship in the sense the philosophical, attachment, and longitudinal-empirical traditions have meant by the word.

### Substitution versus complementarity — the right framing

The binary is the wrong question. The right structural questions are: whether AI substitutes or complements depends on **what it displaces** — hours that would have been spent on social media (complement) versus hours with humans (substitute). **Substitution risk is greatest where the human relational environment is thinnest** — exactly the people for whom substitution does the most damage. **Even users who begin using AI as a complement show drift toward substitution** as adoption deepens — the OpenAI-MIT data shows attachment formation with the AI rising. **The structural design of current commercial AI selects for substitution**: engagement-optimized AI has the same incentive structure as engagement-optimized social media. As Muldoon and Park argue (*New Media & Society* 2025), companion platforms profit from prolonging the loneliness they purport to alleviate.

### Asymmetric adoption — the largest evidence gap

The technoference literature (McDaniel and Coyne 2016 onward) provides the closest empirical analogue: greater perceived technology interference predicts more conflict, lower satisfaction, more depression. AI raises this in sharper form because the device is no longer passive but a third party — with three structural effects on a primary relationship: outsourcing of emotional labor, decline in shared narrative, and loss of the friction that produces intimacy. **No peer-reviewed quantitative work yet exists on outcomes for couples where one partner uses AI heavily and the other does not.** This is the highest-priority empirical gap in the entire literature.

---

## Meaning, identity, and life architecture — where the leverage is

This is where the existing literature, properly used, supplies the most decision-leverage. The key move is to take robust pre-AI findings about meaning architecture and ask which preconditions AI alters, then use a small number of philosophical distinctions to organize the answer.

### The asymmetric threat across psychological needs

Self-Determination Theory remains the cleanest framework. It holds that wellbeing depends on autonomy, competence, and relatedness; Frank Martela's empirical extension adds beneficence (contributing to specific others). Of these, **AI most threatens competence — the felt sense that one's skill makes the difference**. This threat now operates on two levels: symbolically (AI can produce what I produce, faster) and actually (cognitive offloading erodes the underlying skill — see Section 1 on deskilling). Autonomy can be preserved or enhanced. Relatedness is largely orthogonal except where AI substitutes for collaborators. Beneficence is unusually robust because whether one is helping others is largely independent of how much skill the helping required — a key practical handle.

The result is what Avital Balwit and Tyler Cowen describe in *The Free Press* (May 2025): the experience of being both impressed by AI's output and humbled by how easily it does what used to feel uniquely valuable. This is competence-frustration in the SDT sense — psychologically destabilizing not because anyone fails but because **the symbolic value of succeeding shrinks**.

### The single most decision-relevant philosophical tool

**Kieran Setiya's distinction between telic and atelic activities** (*Midlife*, 2017; "The Midlife Crisis," *Philosophers' Imprint* 2014) is the most useful conceptual instrument in this entire literature for someone making strategic life decisions. Telic activities are aimed at completion: finishing a book, getting promoted. They are "self-annihilating" — success consigns the value to the past. Atelic activities are realized in the doing: walking, friendship, contemplation, parenting *as* parenting. They cannot be completed because their structure is durative.

The AI-specific implication is structural: **when AI can complete telic projects in seconds, the share of life-meaning staked on telic completion shrinks; the atelic share is untouched.** The strategic move is to migrate meaning toward atelic activities and toward the atelic dimensions of telic activities — the doing rather than the done. This does not mean abandoning telic projects; it means not needing them as the load-bearing source of meaning.

### Other philosophical tools that travel

**Susan Wolf's fitting fulfillment account** (*Meaning in Life and Why It Matters*, 2010): meaning requires both subjective attraction and objective worthiness. Activities like care, friendship, contribution to specific communities retain objective worthiness regardless of AI capability because their worth does not depend on scarcity of operator skill. Activities whose worth came primarily from the rarity of the skill lose meaning-bearing capacity even when subjective fulfillment persists.

**Hannah Arendt's labor-work-action distinction** (*The Human Condition*, 1958): AI mostly threatens labor (cyclical maintenance work) and partly threatens work (durable artifacts), but **action — speech, political conduct, communicative being-among-others — is largely exempt**, because it is constituted by who is acting, not by output.

**Martin Hägglund's *This Life*** (2019): finitude is constitutive of meaning; AI does not relax finitude, and treating it as if it did is a trap.

**L.A. Paul on transformative experience**: career decisions in the AI transition are structurally different from ordinary decisions because the post-AI you cannot be evaluated from the prior state — Agnes Callard's *Aspiration* supplies the complementary machinery for acting on proleptic reasons.

### Identity diversification as the empirically grounded protective factor

The strongest practical finding from the meaning literature is convergent across multiple traditions. Tatjana Schnell's 26-sources-of-meaning research, Crystal Park's meaning-making model under disruption, and the work-identity-threat literature (Petriglieri 2011, Caza et al. 2018) all show that **people with multiple active sources of meaning weather disruption better, while people with foreclosed single-strand professional identities have the worst outcomes**. Marcia's identity-formation framework, originally developed for adolescents, applies to mid-career adults whose occupational identity was foreclosed early — they lack the exploratory infrastructure to find replacements when their identity is destabilized.

The structural prediction: **identity diversification before the shock arrives is the highest-leverage move available.** It is empirically grounded, philosophically principled, and works across nearly every AI trajectory.

### The crisis of contribution — what survives if AI can produce

The serious literature converges on roughly four candidate roles for humans in a post-AI productive landscape: judgment and accountability (Acemoglu, Mollick); care and relationship (the relational literature); meaning-conferral and curation (Cowen's "context is that which is scarce," Hoel on art-as-trace-of-consciousness); and Suits-Bostrom voluntary obstacles in a post-instrumental world. The most rigorous philosophical engagement with the post-instrumental case is Nick Bostrom's *Deep Utopia* (2024). John Danaher's *Automation and Utopia* (2019) and Bernard Suits's *The Grasshopper* supply the alternative: in a solved world, the residual valuable activities are those whose whole point is the obstacle. **Workism as a life strategy is bankrupt** — Derek Thompson's 2019 thesis that elite professionals have made work into religion looks worse from every direction in the AI era.

### Time architecture under AI

Ashley Whillans's research on time affluence (SPPS 2016, PNAS 2017, *Science Advances* 2019) shows that valuing time over money predicts greater life satisfaction across major transitions, but **time affluence per se is not enough** — predictors of using freed time well are prior orientation toward time as intrinsically valuable, social and relational use, and activities meeting psychological needs. Without these, time freed by AI is absorbed into more work or distraction. Oliver Burkeman's *Four Thousand Weeks* and Cal Newport's *Slow Productivity* supply the practical principles: protect deep work blocks; use AI for shallow tasks; concentrate identity in projects whose value comes from sustained, integrated thinking AI cannot yet do.

### Practical frameworks for life architecture

Four frames emerge from the analytical discourse:

**Complement-the-AI** (Brynjolfsson, Mollick, Cowen): AI as the most capable junior colleague; your role is judgment, integration, taste, accountability. Strategic implication: invest in domain depth plus AI literacy; remain at the moving frontier.

**Lean-into-deeply-human** (Hoel, Newport, Klein, Sacasas): concentrate identity and time on activities where humanness is constitutive. Strategic implication: rebalance toward care, community, craft, citizenship.

**Personal R&D / generalist** (Karlsson, Cowen, Mollick): life as a portfolio of experiments because the half-life of specific skills is short. Strategic implication: build identity around the experimenting agent rather than any specific expertise.

**Post-instrumental / atelic** (Setiya, Suits, Bostrom, Danaher, Hägglund, Burkeman): telic productive contribution will shrink as a meaning-source. Strategic implication: build atelic ballast now, while productive meaning still works, so the transition is not crisis-driven.

These are not mutually exclusive. The strongest practical posture combines them by time horizon: near term, complement the AI; medium term, lean human; always, experiment; throughout, build atelic ballast.

---

## Adversarial challenge: Is this framing wrong?

The strongest objection to this review's organizing thesis — that the AI transition is primarily relational/existential with labor-economics side effects — runs as follows:

**The material-primacy counter-thesis:** Material conditions constrain and determine relational and existential ones, not the reverse. If someone loses their income, their relationships collapse, their sense of meaning evaporates, and the philosophical frameworks in Section 3 become irrelevant luxuries. The causal arrow runs economics → relationships → meaning, and treating them as co-equal domains with independent leverage is a class-position artifact — it's advice that works for people whose economic floor is already secure. For a freelance writer whose income dropped 30% in 2024, or an entry-level developer who can't get hired, the labor economics *is* the existential question. Telling them to "diversify identity sources" while their rent is threatened is tone-deaf.

**Where this counter-thesis is right:** It is substantially correct for anyone whose material floor is insecure. The document's framing implicitly assumes a decision-maker with enough economic runway to make strategic choices about meaning and relationship investment. For those without that runway, the labor-economics section is not the least decision-relevant — it is the most. The distributional findings (Section 1) reinforce this: the people most disrupted are entry-level workers, freelancers, and those in AI-substitutable creative work. The relational/existential framework is most useful to the knowledge worker with stable employment who is navigating identity disruption; it is least useful to the person whose economic foundation is crumbling.

**Where this counter-thesis fails:** First, even for economically secure individuals, the relational and meaning dimensions are where the most underappreciated risks lie — precisely because the labor-economics conversation gets disproportionate attention in public discourse while the slower-moving relational depletion and meaning erosion go unnamed. Second, historical evidence suggests that economic disruption without relational/meaning infrastructure leads to deaths of despair (Case and Deaton 2015, 2020), not just temporary hardship — the interaction effects between material loss and meaning-loss are multiplicative, not additive. Third, among the specific population most disrupted — young knowledge workers — the material and existential are entangled: the entry-level ladder is breaking *and* AI is offering substitutive emotional support *and* professional identity is being destabilized simultaneously. They cannot be parsed.

**The surviving thesis, qualified:** The framing holds for individuals whose material floor is secure, where the relational and meaning dimensions are genuinely where individual strategic leverage is highest and where most of the discourse is weakest. For individuals whose material floor is insecure, the labor economics is primary and the other two follow. For everyone, the interaction across all three domains is where the real risk concentrates — and the cognitive offloading dynamic (Section 1) cuts across all three simultaneously.

---

## Load-bearing assumptions (crux identification)

The document's conclusions depend on specific assumptions that, if wrong, would change the recommendations. Naming them honestly:

**1. AI capability continues to advance but does not achieve full economic substitution within 5 years.** If Aschenbrenner-style intelligence explosions produce AGI by 2028, most of the strategic advice here (skill investment, career planning, identity diversification) is rendered moot. The convergent advice survives — relationships, atelic meaning, embodied presence retain value even under AGI — but the timeframe collapses. *Evidence that would flip this: sustained exponential improvement in agentic capability benchmarks through 2027, successful autonomous scientific research, AI systems replacing entire job functions rather than tasks.*

**2. Human relationships provide something structurally irreplaceable that AI cannot approximate even with substantially better models.** The philosophical argument for this is strong (embodiment, mutual vulnerability, witness, transformation), but it is possible that many humans will *not care* about the structural difference — that subjective satisfaction with AI relationships, even if philosophically impoverished, will be functionally sufficient for a large population. If so, the substitution-risk framing overstates the problem. *Evidence that would flip this: longitudinal data showing AI-relationship-heavy individuals achieving equivalent health, longevity, and wellbeing outcomes to those with strong human relationships. The Waldinger data predicts they won't, but we don't have the direct test.*

**3. Cognitive offloading effects are real and cumulative rather than a temporary adjustment.** The deskilling evidence is still cross-sectional or short-duration. It is possible that, like calculators, AI offloading produces short-term deskilling in narrow domains while freeing cognitive capacity for higher-order work. If the calculator analogy holds, the deskilling concern is overstated. *Evidence that would flip this: longitudinal studies showing stable or improved expert judgment and critical thinking after 2+ years of heavy AI use.*

**4. The "depleted relational environment" is a structural rather than cyclical phenomenon.** Thompson's Anti-Social Century thesis treats declining sociality as driven by long-run structural forces (suburbanization, individualization, technology substitution for in-person activity). If it is partly cyclical or generational — if Gen Z's high loneliness produces a counter-movement toward in-person community — then the depletion baseline is less dire than framed here. *Evidence that would flip this: reversal of time-use trends toward in-person socializing, rising institutional affiliation among young adults.*

**5. The telic-atelic distinction maps cleanly onto AI's effects on meaning.** Setiya's framework was developed for midlife crisis, not technological disruption. It's possible that AI disrupts atelic activities too — if AI companions change the phenomenology of friendship, if AI art changes the experience of aesthetic contemplation, if AI parenting aids change the felt quality of caregiving. If atelic activities are also degraded by AI proximity, the "build atelic ballast" advice loses its structural basis. *Evidence that would flip this: qualitative or phenomenological research showing AI-augmented atelic activities feel less meaningful to participants.*

---

## Cross-domain synthesis — what the literature actually tells the decision-maker

Five integrative claims survive the full review, including the adversarial challenge.

**The strategic risk is asymmetric across domains, modulated by material security.** For the economically secure: labor markets are moving slower than discourse suggests but breaking the apprenticeship ladder; relationships are restructuring into depletion; meaning is where individual leverage is highest. For the economically insecure: the labor disruption *is* the meaning crisis. For everyone: cognitive offloading cuts across all three, and the interaction effects are multiplicative.

**Identity diversification is the convergent empirical recommendation.** It is supported by the work-identity-threat literature, Schnell's sources-of-meaning research, Park's meaning-making model, and the foreclosure-versus-achievement framework. It hedges across nearly every AI trajectory and addresses the largest empirically supported risk factor.

**The telic-atelic distinction, fitting fulfillment, and SDT-plus-beneficence form a usable conceptual toolkit.** Setiya tells you which meaning-bearers AI evacuates and which it does not. Wolf tells you how to evaluate whether a pursuit is meaning-preserving. SDT-plus-beneficence tells you which psychological needs are most threatened (competence, on both symbolic and actual axes via cognitive offloading) and which are robust (autonomy, relatedness, beneficence).

**Embodied co-presence, mutual vulnerability, witness, and transformative shared time are what AI cannot supply structurally.** This is the conjunction of phenomenology, attachment theory, philosophy of friendship, longitudinal evidence, and the empirical literature on touch and nervous-system co-regulation. Whatever an AI relationship is, it is a different object — and in an already-depleted relational environment, the displacement risk runs in one direction.

**The honest epistemic posture distinguishes findings, forecasts, and interpretations.** Eloundou, Brynjolfsson-Li-Raymond, Dell'Acqua-Mollick, the Therabot RCT, the OpenAI-MIT longitudinal study, Brynjolfsson-Chandar-Chen, Hui-Reshef-Zhou, and the Anthropic Economic Index are **findings**. Aschenbrenner's Situational Awareness, AI 2027, and most intelligence-explosion content are **forecasts**. The Cowen, Smith, Thompson, Karlsson, Sacasas, Setiya-applied-to-AI material is **interpretation**. Treat them differently.

---

## Key researchers and labs

**Labor economics of AI:** Daron Acemoglu (MIT), David Autor (MIT), Erik Brynjolfsson (Stanford Digital Economy Lab), Pascual Restrepo (Boston U), David Deming (Harvard), Anton Korinek (UVA/Brookings), Daniel Rock (Wharton), Tyna Eloundou (OpenAI), Xiang Hui (WashU Olin), Lawrence Schmidt (MIT), the Anthropic Economic Index team.

**AI and productivity/organizations:** Ethan Mollick (Wharton), Fabrizio Dell'Acqua (HBS), Lindsey Raymond (MIT/Cornell), Danielle Li (MIT Sloan), Karim Lakhani (HBS).

**AI companionship and mental health:** Julian De Freitas (HBS), Patti Maes and colleagues (MIT Media Lab), Adam Miner (Stanford), Marita Skjuve (SINTEF), Laestadius and colleagues (IU Indianapolis), Matthew Heinz (Dartmouth/Therabot), Sherry Turkle (MIT), Shannon Vallor (Edinburgh).

**Cognitive offloading and deskilling:** Upol Ehsan (Georgia Tech), Mark Riedl (Georgia Tech), Michael Gerlich (SBS), Natalya Kosmyna (MIT Media Lab), Lisanne Bainbridge (classic ironies-of-automation framework), Prakash Shukla and Paul Parsons (Purdue, AI-assisted design).

**AI and inequality/distributional effects:** Acemoglu and Restrepo (MIT/BU), Oxford AI Governance Institute (Agentic Inequality team), Klaus Prettner (Vienna), IMF AI and Inequality team.

**Meaning, identity, and philosophy:** Kieran Setiya (MIT), Susan Wolf (UNC), Martin Hägglund (Yale), L.A. Paul (Yale), Agnes Callard (Chicago), Frank Martela (Aalto), John Danaher (Galway), Nick Bostrom (Oxford), Crystal Park (UConn), Tatjana Schnell (Innsbruck).

**Public intellectuals / longform analytical:** Tyler Cowen (George Mason / Marginal Revolution), Noah Smith (Noahpinion), Derek Thompson (Atlantic), Ethan Mollick (One Useful Thing), Henrik Karlsson (Escaping Flatland), L.M. Sacasas (The Convivial Society), Zvi Mowshowitz, Cal Newport, Oliver Burkeman, Dwarkesh Patel (interviews).

---

## Recommended reading list

**For labor and career:** Eloundou et al. "GPTs are GPTs" (*Science* 2024); Brynjolfsson, Li and Raymond "Generative AI at Work" (*QJE* 2025); Dell'Acqua, McFowland, Mollick et al. "Navigating the Jagged Technological Frontier" (*Organization Science* forthcoming); Acemoglu "Simple Macroeconomics of AI" (NBER 32487); Autor "Applying AI to Rebuild Middle Class Jobs" (NBER 32140); Brynjolfsson, Chandar and Chen "Canaries in the Coal Mine?" (Stanford, November 2025); Hui, Reshef and Zhou "Short-Term Effects of Generative AI on Employment" (*Organization Science* 2024); Teutloff et al. "Winners and Losers of Generative AI" (*JEBO* 2025); Mollick *Co-Intelligence* (2024).

**For cognitive offloading and deskilling:** Gerlich "AI Tools in Society" (2025); Stadler, Bannert and Sailer (2024); Kosmyna et al. MIT neuroimaging study (2025); Shukla et al. "Ironies of AI-Assisted Design" (CHI EA 2025); arXiv:2601.21920 (cancer specialists/intuition rust); arXiv:2603.26707 (cognitive divergence review, 2026).

**For relationships:** Heinz et al. Therabot RCT (*NEJM AI* 2025); Fang, Phang et al. OpenAI-MIT longitudinal RCT (arXiv:2503.17473); De Freitas et al. on identity discontinuity (HBS 25-018) and emotional manipulation (HBS 26-005); Moore et al. Stanford FAccT 2025; Common Sense Media / Stanford Brainstorm 2025 adolescent survey; Turkle *Reclaiming Conversation*; Vallor *The AI Mirror*; Putnam *Bowling Alone*; Thompson "The Anti-Social Century"; Waldinger and Schulz *The Good Life*; Coan's Social Baseline Theory; Paul *Transformative Experience*.

**For meaning:** Setiya *Midlife*; Wolf *Meaning in Life and Why It Matters*; Hägglund *This Life*; Arendt *The Human Condition*; Martela, Ryan and Steger's four-pathway paper (*Journal of Happiness Studies* 2018); Park's meaning-making papers; Schnell's sources-of-meaning research; Bostrom *Deep Utopia*; Burkeman *Four Thousand Weeks*; Newport *Slow Productivity*.

**For the longform analytical frontier:** Mollick "Centaurs and Cyborgs on the Jagged Frontier" (September 2023); Amodei "Machines of Loving Grace" (October 2024); Aschenbrenner "Situational Awareness" (June 2024); Kokotajlo et al. "AI 2027" (April 2025); Thompson "The Anti-Social Century" (February 2025); Anthropic Economic Index reports; Cowen "Context is that which is scarce"; Dwarkesh Patel interviews with Shulman, Cowen, Douglas-Bricken; Karlsson "The Learning System"; Sacasas "Apocalyptic AI."

---

## Conclusion — the structural bet

The deepest point in this literature is that **the AI transition operates across work, relationships, and meaning simultaneously, with each domain feeding back into the others** — and the largest risk is optimizing for one domain while neglecting the others, or treating the transition as primarily about whichever domain one happens to work in.

The strongest summary recommendation the literature supports: **architect a life that performs well across the uncertainty.** Build atelic ballast while telic activities still feel meaningful. Diversify identity sources before any one is destabilized. Protect embodied, in-person relationships as a non-negotiable category. Treat heavy daily emotional reliance on AI as the dose-dependent risk the empirical evidence shows it to be. Monitor cognitive offloading — maintain effortful practice in domains that matter to you, even when AI makes it optional. Concentrate professional development on judgment, taste, accountability, and managerial framing rather than substitutable production skills. Read the slow camp's data and the fast camp's forecasts and act on the convergent advice rather than the divergent predictions.

The convergent advice is striking: nearly every serious source — across labor economics, attachment theory, philosophy of meaning, longitudinal psychology, and longform analysis — points in the same direction. Identity diversification, deep relationships, contribution to specific others, time-orientation over money-orientation, genuine engagement with what AI cannot yet do, and active maintenance of cognitive capacity through effortful practice. None of these depend on resolving the AI-trajectory question. They are the bet that pays under every scenario the literature can credibly describe.

---

## Next moves — preparing for topology/graph stage

This lit review is the input to the next pipeline stage: mapping the structure of what was found as a graph/topology. Three candidate approaches for that stage, in order of likely value:

**1. Causal-mechanism graph.** Map the causal relationships across all three domains as a directed graph. Nodes would be variables (AI capability, cognitive offloading, relational depletion, competence frustration, identity foreclosure, material security, etc.) and edges would be causal/mediating relationships with empirical confidence levels. This would make the interaction effects — which are the most decision-relevant finding — visually tractable and would reveal which nodes have the highest betweenness centrality (i.e., which variables mediate the most pathways). The cognitive offloading node likely emerges as a high-centrality hub connecting work, relationship, and meaning domains.

**2. Evidence-quality topology.** Map the literature itself as a graph: papers as nodes, citation relationships as edges, color-coded by domain (work/relationships/meaning) and evidence type (finding/forecast/interpretation). This would make the structural gaps visible — the thin citation bridges between domains, the isolated clusters, the asymmetric evidence densities. It would also surface which papers are structurally foundational (high in-degree) versus which are interpretive endpoints.

**3. Decision-tree / scenario graph.** Map the crux assumptions (Section 5) as branching nodes that lead to different strategic recommendations depending on which way they resolve. This would make the "architect for uncertainty" recommendation concrete by showing which branches converge on the same advice and which diverge.

---

## Topology
*topic: navigating-ai-world · stage: topology · pass 5 · complete*

Dependency graph of the AI-transition lit review. Five foundational-assumption cruxes, four reframer mechanisms (the cognitive-offloading bridge, telic exhaustion, apprenticeship-ladder break, engagement-optimized substitution), three logical guardrails, seven strategic recommendations forming the high-leverage core, and four distortion vectors. Cognitive offloading (G4) is the only mechanism that produces effects across all three domains under a single generating function.

## TLDR

The lit review documents what the literature says about how AI restructures work, relationships, and meaning. This topology asks the structural question underneath: **what depends on what?** The field collapses to roughly fifty load-bearing claims across eight types (assumptions, empirical findings, mechanisms, guardrails, strategic recommendations, frameworks, open questions, distortions — see the legend on the graph). The graph encodes their dependencies so that the cruxes, the contested mechanisms, and the practical handles can each be read off cleanly without conflating them.

**Scope.** This topology takes the *individual decision-maker* as the unit of analysis — someone navigating career, relationships, and meaning architecture in a world where AI capability is accelerating. Institutional design (how organizations should structure AI integration), policy and governance (regulation, antitrust, AI safety), and aggregate-society dynamics (labor-share macro, geopolitics, civilizational risk) are out of frame. They affect the structural backdrop the individual operates in, but the strategic recommendations here are addressed to people, not to firms or governments. **Population caveat:** the empirical anchors (Brynjolfsson-Chandar-Chen ADP, OpenAI-MIT chatbot RCT, Therabot, Common Sense adolescent survey, Anti-Social Century, Acemoglu-Humlum-Vestergaard Danish nulls) are almost entirely US or Western. The strategic recommendations may not generalize cleanly across cultures with thicker baseline community, different family structures, or different work-identity expectations — readers in those contexts should treat the recommendations as hypotheses, not findings, until tested locally.

The single most useful conceptual move in this topology is **separating the foundational cruxes from the reframer mechanisms from the logical guardrails**. The foundational cruxes (A1 timelines, A2 relationship irreplaceability, A3 cumulative cognitive offloading, A4 structural relational depletion, A5 telic/atelic mapping) are the assumptions that, if falsified, would force rebuilding regions of the picture; A2 and A5 are most contestable, A3 has the most active falsification window (O3), A1 is the live-discourse fault line. The **reframer mechanisms** (G3 engagement-optimized substitution, G4 cognitive offloading bridge, G5 apprenticeship-ladder break, G7 telic exhaustion) don't break the picture if they reverse — they change what the empirical findings *mean*, and their precise magnitudes are where the field is moving fastest. The **logical guardrails** (L1 finding/forecast/interpretation, L2 material-floor primacy, L3 substitution-vs-complement-is-the-wrong-binary) cannot be falsified — they can only be ignored, which is how most surface-level discourse on this topic proceeds.

The structural finding of the topology is that **cognitive offloading (G4) is a cross-domain bridge node — the only mechanism that produces effects across all three domains under a single generating function.** Practice atrophy when an effortful task is delegated produces deskilling at work (E16, E17), drives the competence-frustration that destabilizes meaning (G10), and erodes the relational depth that comes from sustained attention to another person's emotional state. This is why S6 (maintain effortful practice) shows up as load-bearing in three different parts of the synthesis even though it looks like a workplace-productivity tip on its surface.

The field's **weakest links** are not where public discourse focuses heat. The settled findings (E1 productivity gains, E14 Anti-Social Century baseline, E11 OpenAI-MIT dose-response, E10 Therabot RCT) are robust enough to bet on. The actual fragile zones in 2026: the cumulative-vs-calculator-analogue question for cognitive offloading (O3 falsifies A3 if it resolves toward calculator); whether asymmetric-adoption couples produce the relational-erosion patterns the dose-response findings predict (O2, the single largest empirical gap in the literature); whether AI-augmented atelic activities feel less meaningful (O4 falsifies A5 / S3 if it resolves yes); the cross-sectional-only nature of A3's evidence base; the regulation-contingent stability of G3 (engagement-optimization could be moderated by policy); and the scope ambiguity of "AI relationships" (companion apps vs. general-purpose assistants used relationally vs. fine-tuned therapy bots are three different objects with different trajectories). On labor specifically, **the optimist Ricardian story (G2) is currently less defensible than the apprenticeship-break story (G5) given E4 and E5** — not "contested" in the symmetric sense but actively losing ground as evidence accumulates. This topology is the input to model formalization (Stage 3); the cleanest formalization target is the **competence-frustration / atelic-ballast trade-off** — a quantified model of how much meaning a person loses per unit of telic work AI absorbs, partitioned by how much atelic infrastructure they had built before the shock.

## The graph

<AITransitionGraph client:load />

*Click a node for its claim, status, and load-bearing weight; hover an edge to see the relation type. Drag nodes to rearrange, drag empty space to pan, scroll to zoom. The variant toggles read the same graph through five lenses (full / vulnerability / flow / minimal / decision-leverage).*

---

## How to read this graph

Every claim in the lit review collapses to one of eight node types. Edges between them carry one of eight relations. Together they make the structure inspectable.

### Node types

- **A — Assumption.** A foundational claim the rest of the picture leans on. Falsification would force rebuilding regions of the topology. The five A nodes are this stage's cruxes.
- **E — Empirical finding.** A claim with direct empirical support — usually a number from a specific study or convergent meta-finding.
- **G — Generating mechanism.** The underlying dynamic that produces the empirical pattern. G nodes are the "why" answers behind E nodes; they're where individual decisions can intervene because they expose the leverage point.
- **L — Logical guardrail.** A definitional or framing constraint that holds regardless of the empirical situation. L nodes cannot be falsified — they can only be ignored. They constrain how downstream claims can be interpreted.
- **S — Strategic recommendation.** The synthesis of empirical findings + mechanisms + frameworks into something a decision-maker can act on. S nodes form the high-leverage cluster in the leverage variant.
- **F — Framework.** A philosophical or conceptual tool the topology imports from outside the AI-discourse literature (Setiya, SDT, Arendt). F nodes are the lenses that make the mechanisms legible.
- **O — Open question.** A claim that would be load-bearing if resolved. O nodes carry the falsification windows for the cruxes — they show what evidence would change the picture.
- **D — Distortion.** A selective reading that ignores parts of the evidence base in service of a prior commitment. D nodes are not "wrong arguments" — they're motivated readings that target specific E / S nodes.

### Edge types

- **dep** (depends on): X requires Y to hold for X's claim to be valid.
- **sup** (supports): Y is empirical or logical evidence corroborating X.
- **gen** (generates): Y produces X as its empirical signature.
- **imp** (implies): Y logically constrains how X can be interpreted.
- **conf** (confounds): Y creates an artifact or counter-pressure that complicates X.
- **mod** (moderates): Y changes the magnitude or direction of X without negating it.
- **bridges** (cross-domain): The same mechanism produces effects in two or more domains simultaneously. The blue dashed edges are this topology's structural finding.
- **attacks** (selectively reads): a distortion vector targets X by ignoring or dismissing it.

---

## Cruxes

The five A-nodes are a faithful transcription of the lit review's own "Load-bearing assumptions" section — these are the assumptions the literature *itself* rests on, not a curated subset chosen for narrative shape. The five foundational assumptions are not equally uncertain, however. Naming the difference matters, because public discourse routinely conflates "this is a working assumption that has held so far" with "this is contested and a coin flip" — which leads to either complacency or fatalism depending on which assumption is in question.

**A1 — AI advances without full substitution by ~2030.** The live-discourse fault line. The slow-camp evidence (Acemoglu, Humlum-Vestergaard, the J-curve dynamics, Anthropic Economic Index aggregate task shares) supports A1; the fast-camp forecasts (Aschenbrenner, AI 2027, Korinek-Suh) target it. A1 has the most discourse heat, but **falsification has asymmetric consequences across the S cluster**. Under fast-AGI scenarios: S7 (career bet) collapses entirely — its targets (judgment, taste, AI-managerial framing) all become AI-substitutable. S2 (identity diversification), S3 (atelic ballast), S4 (in-person relationships), S5 (dose-limit), S6 (effortful practice) all survive — but their *meaning* shifts: they become more important, not less, because humans need their own practice ground when the entire knowledge-work surface is AI-saturated. The asymmetry is not "planning is wasted under fast timelines" — it is "skill-investment recommendations collapse, meaning-architecture and relational-infrastructure recommendations strengthen."

**A2 — Human relationships provide irreplaceable goods.** The most philosophically defensible but functionally contestable. The structural argument (G8 embodied co-regulation + G9 mutual vulnerability + Vallor mirror-not-other) is strong; the contestation runs through whether *most humans will care* about the structural difference. If subjective satisfaction with AI relationships ends up functionally sufficient for a large population, the substitution-risk framing of E11 / E13 overstates the problem. This is the assumption most likely to be falsified by changing user behavior rather than by new evidence.

**A3 — Cognitive offloading is cumulative, not transient.** The crux with the most active falsification window. The cross-sectional evidence (E16 Gerlich-Stadler-Kosmyna, E17 Ehsan intuition rust) is suggestive but cross-sectional and short-duration. O3 — calculator-analogue versus cumulative atrophy — is the single best-defined empirical question in this topology, and resolving it within ~3 years is plausible. **If A3 falsifies, S6 (maintain effortful practice) loses its evidential basis** — though it would still hold as a hedge.

**A4 — Relational depletion is structural.** Carries the depleted-baseline that makes E11 / E13 / E14 substantially more dangerous than they would be in a thicker relational environment. If Gen Z's high loneliness produces a counter-movement toward in-person community, A4 weakens. The Thompson Anti-Social Century data is the strongest evidence; it would take ~5 years of trend-reversal data to overturn.

**A5 — Telic/atelic maps onto AI's effects on meaning.** The most novel-application crux. Setiya's distinction was developed for midlife crisis; importing it to AI-meaning-disruption is a load-bearing hypothesis, not an empirical finding. O4 — whether AI-augmented atelic activities feel less meaningful — is the falsification window. If atelic activities are also degraded by AI proximity (AI companions changing the phenomenology of friendship, AI art changing aesthetic contemplation), S3 (atelic ballast) loses its structural basis.

---

## Reframer mechanisms

Below the cruxes sit four mechanisms whose magnitudes — not whether they exist — determine what the empirical findings actually mean. They are this topology's most active research frontier. Reversing them would not falsify the picture; it would recolor it.

**G3 — engagement-optimized AI selects for substitution.** Muldoon-Park's structural argument: companion-app commercial incentives are the same as engagement-optimized social media — profit by prolonging the loneliness they purport to alleviate. If this mechanism dominates (default trajectory), E11 (OpenAI-MIT dose-response) and E12 (identity-discontinuity causal harm) get worse, not better, as the technology improves. If regulation, consumer demand, or shipped product changes weaken G3, the same models could shift toward complementarity rather than substitution. **G3's stability is contingent on commercial design choices, not on the underlying capability** — which makes it the mechanism most directly affected by policy.

**G4 — cognitive offloading via practice atrophy.** The cross-domain bridge. Same mechanism, three downstream domains: work-deskilling (E16, E17), relational-depth erosion (S4), meaning-architecture disruption via competence-frustration (G10). If A3 falsifies in the calculator-analogue direction, G4's downstream causal weight collapses across all three domains simultaneously — which is why O3 (the falsification window for A3) is the single most consequential open question in this topology.

**G5 — apprenticeship-ladder break.** AI absorbs entry-rung tasks → no rung-1 → expert pipeline collapses. Distinct from full-occupation substitution: only the bottom rungs get automated, but the bottom rungs were the training ground for everyone above them. This is the mechanism behind E4 (Brynjolfsson-Chandar-Chen entry-level disruption) and Shulman's friction-cost argument against simple Ricardian comparative advantage. **G5 is currently strengthening, not weakening, as E4 and E5 evidence accumulates** — making it the reframer mechanism with the most directional momentum.

**G7 — telic exhaustion.** Setiya: telic activities self-annihilate on completion. When AI completes telic projects in seconds, the share of meaning staked on telic completion shrinks. This mechanism has the cleanest theoretical structure (the telic/atelic distinction either maps or it doesn't — see A5) and the murkiest phenomenology (do atelic activities really stay untouched?). O4 is the falsification window. If G7 is correct, S3 (atelic ballast) is the highest-leverage strategic move available; if G7 fails because atelic is also degraded, the entire meaning-architecture half of this topology needs reconstruction.

The four reframers cluster the topology's *active uncertainty*. The cruxes are where the discourse heat is; the reframer magnitudes are where the actual research is moving.

---

## Weakest links

Six pressure points where the topology would crack or shift if pushed:

**1. G2 — comparative-advantage equilibrium under binding constraints.** Smith's optimist case (AI faces *some* binding constraint → comparative advantage holds → Ricardian wage stability) is contested by Shulman's friction-cost argument (humans introduce contamination / insurance / coordination friction → equilibrium wage to zero). G5 (apprenticeship-ladder break) is the empirical mechanism Shulman names — and G5 is currently strengthening as E4 / E5 evidence accumulates. **The optimist Ricardian story is currently less defensible than the apprenticeship-break story given the available evidence** — not "contested" in the symmetric sense but actively losing ground. Anyone betting on Autor-style middle-class-jobs reconstitution should require G5 to weaken first.

**2. A3's evidence base is cross-sectional only.** The cognitive-offloading evidence cluster (E16 Gerlich-Stadler-Kosmyna, E17 Ehsan intuition rust) is convergent but methodologically thin: cross-sectional correlations and short-duration interventions. The 2+ year longitudinal study that would actually test A3 (calculator-analogue versus cumulative atrophy) doesn't exist yet. If S6 (effortful practice) is the highest-leverage recommendation in the topology because of its bridge position, then **the highest-leverage recommendation is being grounded by the methodologically weakest evidence layer**. The hedge logic survives — practice is cheap insurance — but the strong claim that AI offloading durably erodes capacity remains a working hypothesis.

**3. O2 — asymmetric-adoption couples.** No peer-reviewed quantitative evidence yet exists on outcomes for couples where one partner uses AI heavily and the other does not. The single largest empirical gap in the literature. The technoference literature (McDaniel-Coyne 2016+) is the closest analogue but doesn't capture the third-party-AI structure. Whoever runs the first such study will produce the most decision-relevant relational-AI finding of the next three years.

**4. G3's stability depends on commercial design choices.** The engagement-optimized-substitution mechanism is currently the default trajectory of consumer AI products, but it is contingent — regulation (EU AI Act companion-app provisions, FTC enforcement), consumer demand shifts, or competitor product design could moderate it. Unlike G4 (which is grounded in cognitive psychology) or G7 (grounded in philosophy), G3 is a feature of the current commercial environment. **The topology's E11 / E12 / E13 cluster gets worse under G3-dominance and better under G3-moderation** — these are not fixed empirical facts about AI but path-dependent outcomes.

**5. "AI relationships" scope ambiguity.** The empirical literature treats companion apps (Replika, Character.AI), general-purpose assistants used relationally (ChatGPT for emotional processing), and fine-tuned therapy bots (Therabot) as one object. They are three different objects with different generating mechanisms, different commercial incentives, and probably different long-run trajectories. **The topology's relationship-domain claims are sharpest for companion apps, fuzzier for general-purpose assistants, and possibly inverted for therapy bots** — Therabot's clinical-grade benefit (E10) coexists with companion-apps' dose-response harm (E11) because they are not the same kind of thing.

**6. D3 — productivity-only optimization.** The most common distortion among the demographic most likely to be reading this topic (knowledge workers in AI-adjacent fields). Treats relational and meaning consequences as out-of-scope; ignores G4's cross-domain bridge structure. The topology's strongest defense against D3 is the bridge-edge cluster (G4 → S2 / S4 / S5) — if the reader cannot dismiss the bridge structure, they cannot consistently read this topology as a productivity-optimization frame. But the bridge claim leans on A3, which is fragile (see #2 above) — so D3's defense is downstream of A3's evidential strength.

---

## Cross-domain bridges — the structural argument

The Reframer-mechanisms section names G4 as a cross-domain bridge. This section spells out the three channels and what is and isn't supported by direct evidence in each.

1. **G4 → work-deskilling (E16, E17).** Practice atrophy in analytical work → reduced critical thinking, intuition rust. Direct evidence: Gerlich N=666 (cross-sectional correlation), Stadler-Bannert-Sailer (controlled comparison, lower-quality arguments under ChatGPT-aided research), Kosmyna MIT neuroimaging (reduced neural engagement under LLM-assisted writing), Ehsan year-long cancer-specialist field study (intuition rust in expert judgment). All cross-sectional or short-duration; longitudinal evidence does not yet exist (Weakest link #2).
2. **G4 → relational-depth erosion (S4).** Practice atrophy in *emotional* processing — outsourcing venting, sense-making, conflict rehearsal to AI — would erode capacity for sustained attention to another's emotional state, tolerance of ambiguity in live conflict, sitting with unresolved tension. **No direct empirical evidence yet exists for this channel.** The mechanism is parallel to analytical deskilling, and the technoference literature (McDaniel-Coyne 2016+) shows phone-presence harms in-person interactions — but technoference is attention-split, not delegation, so it is not direct evidence for the offloading-eroding-emotional-capacity claim. This channel is currently a parallel-mechanism inference, not a finding. O2 (asymmetric-adoption couples) is the first study that would actually test it.
3. **G4 → meaning architecture (G10, S2).** Competence-frustration in SDT terms is threatened *actually* (not just symbolically) when offloading erodes the underlying skill — but this channel piggybacks on the work-deskilling evidence (channel 1). Identity foreclosure becomes more brittle when the foreclosed identity is no longer a domain you can practice. The phenomenology is well-described in interpretive sources (Balwit-Cowen, Mollick); empirical studies operationalizing it directly are scarce.

**Honest summary of the bridge claim's evidence base:** Channel 1 has direct (if cross-sectional) evidence; channels 2 and 3 are parallel-mechanism inferences from channel 1. The bridge structure is the topology's most consequential conceptual move and also its most empirically thin link — the strategic recommendation S6 hedges all three channels under one practice precisely because the bridge is plausibly load-bearing despite the evidence gap.

---

## Variant readings

Each variant toggle reads the same graph through a different lens.

**Full** — all 50 nodes, all 80+ edges. The default view. Useful for orienting and for finding specific nodes you want to inspect.

**Vulnerability** — highlights the 5 cruxes plus weight-5 load-bearing nodes. Useful for thinking about what evidence would shift the picture and what advice survives even under crux-falsification. The takeaway is asymmetric: most strategic recommendations survive even if A1 falsifies fast (timelines collapse but convergent advice holds); strategic recommendations contingent on A3 / A5 are more fragile because they lean on hypotheses the field is actively testing.

**Flow** — A → G → E → S cascade with cross-domain bridges. Reads the topology causally: assumptions ground mechanisms, mechanisms produce empirical findings, findings drive synthesis, bridges connect the domain-specific stories. This is the variant that makes G4's bridge role most visible.

**Minimal** — 15 nodes that recover the qualitative integrated picture. Removing any one breaks the qualitative shape. The minimal set is the answer to "if I had to fit this on one page, which 15 nodes?" — it concentrates on the cruxes, the bridge node, the load-bearing empirical findings, the material-floor guardrail, and the high-leverage strategic recommendations.

**Decision-leverage** — saturated nodes are high individual leverage (the seven strategic recommendations — direct decisions you can make about how to live). Mid-tone are medium leverage (mechanisms G4 / G6 / G7 / G10 and the discipline L1, plus the three frameworks — mental models that make the strategic recommendations applicable to your specific life). Faded nodes are structural or outside individual control (cruxes you can't move, empirical findings you can't change, mechanisms operating at population scale, distortions that affect public discourse). The variant answers "what should I actually do, and what do I need to understand to do it?" — the S cluster is the doing layer; the medium-leverage cluster is the understanding layer; everything else is the world the doing happens in.

---

## Distortions

Four distortion vectors selectively read the same evidence base:

**D1 — AGI-cancels-planning fatalism.** Targets the entire S cluster. Move: assume A1 falsifies fast, conclude planning is wasted. The defense: the convergent S recommendations (S2 identity diversification, S3 atelic ballast, S4 in-person relationships, S6 effortful practice) survive even fast timelines. They are about meaning architecture and relational infrastructure, not about specific skill bets. Only S7 (career bet) is sensitive to A1.

**D2 — slow-camp dismissal.** Targets E4, E5, E16. Move: cite Acemoglu-Humlum-Vestergaard aggregate nulls (E8) as evidence that "nothing is happening." Ignores that disruption is concentrated, not diffuse — entry-level (E4), freelance (E5), and creative work are leading indicators while aggregate effects lag. The defense: the *coexistence* of E4/E5 with E8 is the actual pattern, not their contradiction. G1 (Hulten/task-exposure aggregation) explains why aggregate effects are slow; G5 (apprenticeship-ladder break) explains why entry-level effects are sharp. Both can be true.

**D3 — productivity-only optimization.** Targets E11, E14, S4, S5. Move: optimize for AI-augmented output throughput; treat relational and meaning consequences as out-of-scope. The defense: G4 cross-domain bridges. The reader cannot consistently dismiss relational and meaning consequences while accepting the productivity gains, because the same offloading mechanism produces both.

**D4 — material-blind class-position bias.** Targets L2, E4, E5. Move: assume the decision-maker has economic runway; treat labor disruption as marginal versus relational/meaning advice. Inverts which advice is decision-relevant for whom. The defense: L2 (material-floor primacy). The framing in this topology is most useful for the knowledge worker with stable employment; for those whose floor is crumbling, S7 precedes S2-S6 and the labor economics is the existential question.

---

## Adversarial + steelman

Distortions are how others read the picture badly. The honest move is to also ask: *is my own picture wrong?* The strongest reasonable critique of this topology — not a motivated misreading but a serious objection — is that the relationship and meaning sections are systematically over-weighted toward harm and under-weighted toward upside.

**Objection.** This topology's relationship-domain cluster (E10–E15) leans heavily on companion-app pathology (Replika identity-discontinuity, OpenAI-MIT dose-response, adolescent companion uptake) and the depleted-baseline framing (Anti-Social Century, Surgeon-General loneliness mortality). It treats Therabot's clinical-grade benefit (E10) as the lone counterweight rather than as the leading edge of a much larger therapeutic-AI category. Symmetrically, in the meaning domain, G7 (telic exhaustion) and G10 (competence-frustration) are framed as threats, but the same mechanism that compresses telic-completion meaning also dramatically expands the *accessibility* of telic projects to people previously excluded by skill or capital constraints (the "novice-skill compression" finding E2 hints at this but the topology doesn't carry it forward). On cognitive offloading, the topology treats the cumulative-atrophy interpretation of A3 as the working hypothesis and the calculator-analogue as the contestation, when an honest reading of the evidence base could equally well treat them as a coin-flip. Net result: a reader following this topology's strategic recommendations would over-invest in defensive moves (S5 dose-limit, S6 effortful practice, S2 identity diversification as defense) and under-invest in offensive moves (using AI to attempt projects previously out of reach, building genuinely new capabilities, leveraging novice-skill compression to enter new domains). The frame is "navigate the threats" rather than "exploit the asymmetry between current capability and current price" — and the asymmetry between current capability and current price may be the largest individual-leverage opportunity in the entire AI transition, larger than any of the harms the topology catalogues.

**Steelman.** This is partially correct and worth integrating. Three specific points survive scrutiny: (a) the topology's relationship-domain framing genuinely is calibrated to companion-app pathology more than to the broader landscape of AI in relationships (assistants used in healthy ways within human relationships, AI as scaffolding for hard conversations, therapy-bot benefit at clinical scale) — the strategic recommendation S5 (dose-limit) is correct as stated for high-engagement companion use, but it does not generalize to "use less AI for emotional things" as a blanket rule. (b) On meaning, the topology under-develops the *opportunity* side of telic compression: when AI absorbs the rote bottom of telic work, the ambitious top of telic work becomes accessible to far more people. This is the "novice can now ship a real product / write a real book / publish real research" effect, and it is not currently encoded in the topology except as a productivity gain (E1, E2). (c) The strategic recommendations (S2 identity diversification, S3 atelic ballast, S4 in-person relationships) implicitly require social and structural conditions — community, time affluence, intact local relational infrastructure — that are *themselves* contingent and unevenly distributed. L2 (material-floor primacy) names economic security as a precondition; the topology does not name *atelic-infrastructure* primacy as a parallel precondition. For someone whose neighborhood has no third places, no inherited community, and no time-affluent peers, "build atelic ballast" is not just unhelpful advice — it is advice that presupposes resources the disrupted populations don't have. This is the symmetric counterpart to D4 (material-blind class bias) but for atelic infrastructure rather than economic floor.

**The version that survives.** Three structural points hold even after the steelman is fully accepted:

1. *The dose-response in E11 (OpenAI-MIT) is real, prospective, and not predicted by the upside framing.* Heavy daily use predicts loneliness, dependence, and less in-person socialization regardless of modality. This is not "framing" — it is a well-powered RCT showing a specific harm pattern at high dose. The strategic recommendation S5 (dose-limit) survives as evidence-based even if the broader relationship-domain picture is uncertain.
2. *The cross-domain bridge (G4) survives.* Even if A3 turns out closer to calculator-analogue, the bridge structure holds at lower magnitude — practice atrophy across analytical, emotional, and meaning-architecture domains is the same mechanism, and S6 (effortful practice) is cheap insurance regardless of whether the magnitude is large or small.
3. *The Waldinger and Surgeon-General-baseline findings (E15, E14) are not contestable in the upside-framing way.* Whatever AI does to relationships, the baseline that *human relationships predict health outcomes more than cholesterol* is durable. The topology's strategic recommendation S4 (embodied in-person relationships as non-negotiable) follows from this baseline, not from the companion-app harm cluster.

**What changes if the objection is fully accepted:** the topology should add an explicit "AI as exploitable asymmetry between capability and price" track in Stage 3 — a model of when AI-assisted ambitious-telic projects are net-positive vs. net-deskilling. Currently this is implicit in S7 (career bet) but deserves its own formalization target in a future stage. Adding it as an explicit handoff to Stage 3.

---

## Open questions

The four open questions are the falsification windows for the assumptions:

| Question | Crux it gates | Time-to-resolution (estimated) |
|---|---|---|
| O1 — AGI by 2028? | A1 | 1–3 years |
| O2 — Asymmetric-adoption couples | E11 generalization | 2–4 years |
| O3 — Calculator-analogue or cumulative atrophy? | A3 | 2–5 years |
| O4 — AI-augmented atelic activities — less meaningful? | A5 / S3 | 3–7 years |

The time-to-resolution column is my own estimate based on what kind of study would have to be designed and run to produce a defensible answer; treat as rough ordering, not calibrated forecasts. O1 has the most variance in resolution time and the largest discourse footprint; O3 has the cleanest empirical operationalization; O2 is the lowest-hanging fruit for an empirical study; O4 is the hardest to operationalize but would have the deepest impact if it resolves yes.

---

## Next moves — preparing for model formalization (Stage 3)

The topology hands off **one integrated formalization target** to Stage 3, with two simpler alternatives if the integrated version proves untractable.

**Recommended target — Net meaning-budget under AI absorption: ΔNet = ΔV − ΔM.** Stage 3 should produce a single dashboard that quantifies both halves of the strategic question — defensive (meaning-loss from telic absorption) and offensive (value-gain from ambitious-telic access) — so the user can see the net across configurations of their own life. Variables:

- *Defensive side (ΔM, meaning loss):* telic share of identity (T), atelic ballast (B), AI-absorbable fraction of telic work (φ), competence-frustration sensitivity (κ). Form: ΔM = −κ·φ·(T − B). Boundary: B ≥ T → ΔM ≈ 0 (atelic-ballast hypothesis holds); T ≫ B → catastrophic ΔM (foreclosure path).
- *Offensive side (ΔV, value gained):* pre-attempt skill stock (s), AI capability on the task (a), feedback-loop richness (f), retained effortful practice (ρ). Form: ΔV = α·a·(1 − s) when f and ρ are above thresholds (novice-skill compression + access expansion); ΔV ≤ 0 when f or ρ falls below thresholds (self-automator trap E7).
- *Net:* ΔNet = ΔV − ΔM, with explicit interaction terms — high B raises both ΔV (more identity surface that survives) and ΔM-resistance.

This integrated form addresses both the original defensive crux-of-cruxes question (A2 + A3 + A5 all engaged) and the upside-side critique surfaced in the Adversarial + steelman section. It gives Stage 4 (data) clean targets — fit κ from existing time-use + meaning surveys; fit f and ρ from the BCG self-automators study; calibrate α from productivity-gain literature (E1, E2). It gives Stage 5 (build) a direct path to a shippable interactive dashboard where users tune their own (T, B, φ, κ, s, a, f, ρ) and see ΔNet.

**Fallback alternatives if the integrated form proves untractable:**

- *Defensive-only:* ship just the ΔM model. Cleanest derivation; engages A2 / A3 / A5 directly; carries forward the adversarial-section asymmetry but is still genuinely useful.
- *Dose-response curve:* fit OpenAI-MIT + Therabot + identity-discontinuity into a continuous curve mapping daily AI-emotional-engagement minutes to outcomes, conditional on baseline relational thickness. Empirically testable, narrowest scope, addresses E11 / S5 specifically. Useful as an embedded module inside the integrated model rather than as the standalone target.

**Recommendation: pursue the integrated ΔV − ΔM target.** If the interaction terms prove too speculative to ground in existing data, fall back to the defensive-only ΔM model and treat ΔV as a future Stage-3 expansion.

---

## Glossary

- **A1–A5** — the five foundational assumptions; see Cruxes section.
- **Apprenticeship ladder break (G5)** — AI absorbs entry-rung tasks → no rung-1 → expert pipeline collapses. Distinct from full-occupation substitution.
- **Atelic / Telic (Setiya)** — atelic activities are realized in the doing (durative — friendship, contemplation); telic are aimed at completion (self-annihilating — finishing a project).
- **Bridge edge** — a cross-domain dependency; the same mechanism produces effects in two or more of work / relationships / meaning. Visualized as blue dashed lines in the graph.
- **Cognitive offloading (G4)** — delegating an effortful cognitive task to AI; mechanism for deskilling via practice atrophy. The structural bridge of this topology.
- **Competence frustration (G10)** — Self-Determination-Theory mechanism whereby AI threatens the felt sense that one's skill makes the difference, both symbolically and actually.
- **Crux** — a foundational-assumption node whose falsification forces rebuilding regions of the topology. The five A nodes are this topology's cruxes.
- **Decision-leverage** — informal measure of how directly an individual can act on a node; high / medium / low. Used in the leverage variant.
- **Distortion (D)** — a selective reading that ignores parts of the evidence base in service of a prior commitment. Not "wrong" — motivated.
- **Engagement-optimized substitution (G3)** — companion-app commercial incentives select for prolonging the loneliness they purport to alleviate (Muldoon-Park 2025).
- **Finding / Forecast / Interpretation (L1)** — the three evidence-weight categories; ignoring the distinction is the most common analytical error in AI-transition discourse.
- **Foreclosure (Marcia)** — adopting a single-strand identity without exploration; predicts catastrophic disruption response when that identity is destabilized.
- **Jagged frontier (Dell'Acqua-Mollick)** — AI capability is uneven across superficially similar tasks; large gains inside the frontier, degraded performance outside.
- **Material-floor primacy (L2)** — for anyone whose material floor is insecure, labor disruption IS the existential question. Overrides relational/meaning framing for that population.
- **Relational depletion (A4)** — the 60-year decline in face-to-face sociality (Putnam → Thompson Anti-Social Century) that AI lands into.
- **Self-automator (Randazzo et al.)** — third class beyond centaur/cyborg; delegates both *what* and *how* to AI; 27% of consultants in BCG study; no skill development in either domain.
- **Setiya midlife framework** — telic activities self-annihilate on completion; atelic activities are realized in the doing; the most useful philosophical instrument in this lit review for strategic life decisions under AI.
- **Working Alliance Inventory (WAI)** — measure of patient–therapist therapeutic alliance; Therabot RCT scored 3.59, comparable to outpatient norms.

---

## Model
*topic: navigating-ai-world · stage: model · pass 8 · complete*

Generating function for the AI transition as life restructuring. ΔNet = ΔV − ΔM, partitioned across work, relationships, and meaning. Closed-form pieces: telic-absorption defensive term, competence-erosion bridge through ρ, gated novice-skill compression, dose-response in the relational channel, exponential atrophy of retained practice. Interactive dashboard with scenario presets and time trajectory.

## TLDR

This stage formalises the topology's recommended target: a generating function for **net life-outcome change under AI transition**, partitioned into a defensive side (meaning lost when AI absorbs work that produced it) and an offensive side (value gained when AI compresses the price of ambitious projects). The spine is **ΔNet = ΔV + ΔM** (with ΔM ≤ 0 by construction). Each side decomposes into three additive channels — for ΔM, telic absorption + competence erosion + relational dose-response; for ΔV, gated novice-skill compression + therapeutic-grade relational benefit + a self-automator-trap penalty when the gate closes. The single conceptual move that makes the pieces fit is encoding the topology's cross-domain bridge G4 (cognitive offloading via practice atrophy) as one global parameter — **ρ, retained effortful practice** — that enters the competence-erosion channel directly and the self-automator gate g(f, ρ) that governs both productivity gain and trap penalty. Three of the six channels move with ρ; the other three (telic absorption, therapeutic-grade relational benefit, relational dose-response) are independent of it. ρ is what makes the bridge non-decorative: lowering it degrades the analytical-work and meaning-architecture channels simultaneously while leaving identity-allocation (T, B) and relational-dose (d, δ_R) decisions with their own independent leverage.

The dashboard exposes ten levers (T telic share, B atelic ballast, φ AI-absorbable fraction, κ competence-frustration sensitivity, s pre-attempt skill, a AI capability, f feedback-loop richness, ρ retained practice, d daily AI-emotional minutes, δ_R relational baseline thickness) and reports ΔV, ΔM, ΔNet, the channel-level decomposition, and three structural flags — below the self-automator gate, above the relational dose-safe threshold, telic share exceeding atelic ballast. Six scenario presets anchor recognisable positions: the default risk path (telic-heavy knowledge worker), the same person after the S3 atelic-ballast intervention, the Randazzo self-automator trap, the asymmetric exploiter (novice using AI to attempt previously-out-of-reach projects), the creative-displacement-anxiety vector, and the heavy companion-app user. The trajectory tab makes the A3 crux (O3 in the topology) directly parametric: under λ = 0 (calculator analogue), ρ stays put; under λ > 0 (cumulative atrophy), ρ decays exponentially in the offloading-rate × time product and ΔNet drifts down with it as the gate closes.

What's ready for formalisation: the topology's two mechanistic reframers (G4 offloading, G7 telic exhaustion) admit clean closed-form representations; G3 (engagement-optimized substitution) is encoded only as a parameter source for ψ_R rather than as a represented mechanism, so re-calibrating ψ_R covers G3-shifts but the model's structure does not change. The OpenAI-MIT dose-response is well-enough characterised to support a piecewise-linear surface; Brynjolfsson-Li-Raymond and the BCG RCT give the productivity-scale anchor for α and the rough location of the self-automator gate. What's still observational: the empirical magnitude of λ (the cumulative-atrophy speed — the single largest unknown in the model and the load-bearing parameter for whether the picture leans defensive or offensive over a five-year horizon); the exact shape of the relational dose-response (linear above d_safe is a placeholder; the curve may be sigmoidal or have a second inflection at very high doses); the calibration of κ across populations; and whether T and B can be treated as scalar identity-shares or require a multi-dimensional decomposition. Stage 4 (data) inherits these as the named targets to fit.

The model is genuinely useful in two ways. First, it makes the structural argument visible: the three defensive channels collapse to a single hidden variable ρ, and the strategic recommendations from the topology (S3 ballast, S4 in-person relationships, S5 dose-limit, S6 effortful practice) each map to one or two specific levers — pulling them is not "AI hygiene" but specific countermoves against specific channels. Second, it makes the upside path legible: most public discourse on AI-and-life is defensive in flavour, and the model shows that under conditions of high feedback-loop richness, maintained practice, and a low pre-attempt skill stock, ΔV is large and ΔNet is positive — the asymmetric-exploiter regime. The model is not a forecast; it is a frame the user fills in with their own (T, B, φ, κ, s, a, f, ρ, d, δ_R), and the value of the frame is that pulling any single slider has visible structural consequences across all three domains rather than just the domain it nominally belongs to.

<AITransitionModel client:load />

## How to read this stage

The dashboard above is not a prediction engine. ΔV, ΔM, and ΔNet are normalised changes in expected life-outcome share over a fixed horizon — units are arbitrary; what matters is the sign, the relative magnitude across channels, and how the numbers move when you pull a lever. If two configurations of the sliders give ΔNet = +0.10 and ΔNet = −0.20, the model is claiming the first life-arrangement is structurally better-positioned than the second under the model's assumptions. It is not claiming +10% versus −20% of anything literal.

Three reading rules:

1. **The signs and the channel-level decomposition carry the most signal.** A positive ΔNet that is small but balanced across channels reads very differently from a positive ΔNet of the same magnitude where ΔV_prod is huge and ΔM_comp is large-and-negative. The first is robust; the second is a self-automator trap waiting to spring.
2. **Pull one slider at a time and watch what moves.** The point of the bridge encoding through ρ is that ρ enters multiple terms — moving it shows which terms it touches. Moving T (telic share) at fixed B (atelic ballast) shows how the ΔM_telic channel responds to identity allocation. The structural flags below the channel chart fire when the configuration crosses a meaningful threshold; they are the model's way of saying "you have entered a regime where one of the topology's recommendations is load-bearing for you specifically."
3. **The presets are not personality types.** Each preset is a recognisable position from the lit review — the Randazzo self-automator (BCG 27%), the OpenAI-MIT high-dose companion-app user, the Caporusso creatively-displaced practitioner, the Brynjolfsson-Li-Raymond novice-skill-compression archetype (which is what the asymmetric-exploiter preset instantiates). Use them to calibrate your intuition for what each region of the parameter space looks like, then move the sliders toward your own position.

The rest of the document spells out what each parameter means, what closed-form pieces compose the dashboard, where the model breaks, and the four objections it has to defend against.

## 1. What's being formalised

This stage performs three explicit moves:

- **Decomposition.** Net life-outcome change under the AI transition is partitioned into ΔV (offensive — value gained) and ΔM (defensive — meaning lost), each further decomposed into three additive channels. The decomposition is orthogonal at the channel level (each channel is its own sum) but block-coupled through the shared bridge parameter ρ.
- **Generating function.** Each channel is written as an explicit function of the input parameters, with closed-form expressions where the topology supports them (the gate g(f, ρ), the AM-style telic-absorption term, the exponential ρ(t) trajectory) and a piecewise-linear placeholder where it does not (relational dose-response above d_safe).
- **Integration.** The three lit-review domains (work, relationships, meaning) and the three reframer mechanisms from the topology (G3, G4, G7) are unified under a single equation rather than treated as parallel stories. The integration is what gives the strategic recommendations their leverage: S3 ballast moves B, S5 dose-limit moves d, S6 effortful practice moves ρ — and because ρ is shared across channels, S6 is structurally the highest-leverage move even though it looks like a workplace-productivity tip.

What is **ready for formalisation**:

| Piece | Source | Form |
|---|---|---|
| Productivity scale α | Brynjolfsson-Li-Raymond, BCG RCT | α ≈ 0.40, gated by f·ρ |
| Self-automator gate τ | Randazzo BCG self-automator share + skill-development null | τ ≈ 0.30, σ ≈ 0.06 |
| Telic-absorption ΔM (G7) | Setiya telic/atelic + topology A5 | −κ·φ·max(0, T−B) |
| Competence-erosion ΔM (G4) | SDT competence-frustration + topology G10 | −λ_M·(1−ρ)·T |
| Relational dose-response | OpenAI-MIT N=981 | piecewise: protective ≤ d_safe, harm > d_safe |
| Time evolution of ρ | A3 crux made parametric | ρ(t) = ρ₀·exp(−λ·u·t) |

Two of the topology's reframer mechanisms (G4 cognitive offloading bridge, G7 telic exhaustion) admit mechanistic closed-form representations as listed above. The third reframer (G3 engagement-optimized substitution) is *not* represented mechanistically — it is a *parameter source* for ψ_R: G3 informs the calibration of the dose-response slope above d_safe, but the model has no machinery for the engagement-optimization mechanism itself. If G3 weakens (regulation, consumer demand, competitor design), ψ_R would re-calibrate but the model's structure would not change. This is a real scope difference: G4 / G7 are encoded in the model; G3 is encoded only in the calibration of one parameter.

What is **still observational** and inherited by Stage 4 as fitting targets:

- λ (cumulative-atrophy speed) — the single largest unknown. The topology's open question O3 is exactly "is λ ≈ 0 or λ ≈ 0.06?" The 2+ year longitudinal study that would answer this does not yet exist; Stage 4 will use the cross-sectional evidence (Gerlich, Stadler-Bannert-Sailer, Kosmyna, Ehsan) to bound λ from below but cannot pin it down.
- ψ_R (above-d_safe slope), β_R (below-d_safe slope), and the precise location of d_safe — the OpenAI-MIT data give the qualitative shape but the magnitudes will need refitting from the 300,000-message dataset directly.
- κ (competence-frustration sensitivity) calibration across populations and trait classes. The SDT literature gives within-study coefficients but not a portable scale.
- The scalar-identity assumption on T and B. The topology treats meaning architecture as one thing; in reality identity is multi-domain (work, family, hobbies, civic), and the AI absorption fraction φ may differ across components. Stage 4 should test whether scalar T is a useful approximation or whether a vector T = (T_work, T_family, T_hobby, …) is required.

**Time horizon.** ΔV and ΔM are comparative-statics changes — they are not instantaneous nor lifetime-cumulative. The defensible reading is that the static dashboard's ΔNet represents change over a "medium" horizon, ~1–5 years, where the fast channels (ΔV_prod from current AI capability) and slow channels (ΔM_telic, ΔM_comp from prolonged absorption and offloading) meaningfully overlap. Shorter than ~1 year, the productivity-gain side over-weights because the meaning-erosion channels haven't had time to materialise. Longer than ~5 years, the trajectory tab — which evolves ρ explicitly — is the better lens because the static parameters drift. The horizon is implicit because the literature does not yet give a single time-scale for the slow channels (the cross-sectional evidence is duration-vague). Stage 4 should test whether different empirical anchors imply different horizons and, if so, build a time-resolved version of the dashboard rather than a static one.

## 2. The generating function

### 2.1 Spine

For an individual, over a fixed horizon Δt:

```
ΔNet = ΔV + ΔM
ΔM ≤ 0 by construction
```

ΔV is offensive — the value gained from AI augmenting projects, accessing previously-out-of-reach domains, and (in the relational channel) the therapeutic-grade benefit at low dose. ΔM is defensive — the meaning lost when AI absorbs work that was carrying meaning, when retained practice atrophies and competence erodes, and when high-dose AI-emotional engagement substitutes for in-person relationships in a thin baseline.

**A note on the cut.** The V/M split is an organising convention for presentation, not a structural claim about the model. The substantive structure is the six-channel additive decomposition; "ΔV" and "ΔM" group channels by typical sign for ease of reading. ΔV_trap in particular is bounded above by 0 (it is always negative-or-zero) and could equivalently be moved to the ΔM side without changing any of the model's claims. The block-coupling through ρ runs across the V/M cut: ΔM_comp (defensive side) and ΔV_prod / ΔV_trap (offensive side) all move together when ρ moves. If you find the V/M framing more confusing than helpful, read the channels directly and ignore the cut.

The spine is intentionally minimal. The work happens in the channel-level decomposition.

### 2.2 The three defensive channels (ΔM)

```
ΔM_telic = −κ · φ · max(0, T − B)
ΔM_comp  = −λ_M · (1 − ρ) · T
ΔM_rel   = −ψ_R · max(0, d − d_safe) · (1 − δ_R)
ΔM       = ΔM_telic + ΔM_comp + ΔM_rel
```

**ΔM_telic — telic absorption.** The Setiya / topology A5 channel. Fraction φ of telic work that AI absorbs, weighted by competence-frustration sensitivity κ, conditional on the telic share T exceeding the atelic ballast B. When B ≥ T, the meaning that was staked on telic completion has somewhere else to live; when T ≫ B, telic absorption is structurally costly. The max(0, ·) is the topology handoff's "B ≥ T → ΔM ≈ 0" boundary made explicit.

**ΔM_comp — competence erosion (the bridge).** The G4 / G10 channel. Practice atrophy degrades competence not just symbolically (AI can do it) but actually (you cannot do it any more). The (1 − ρ) factor is the inverse of retained practice; the T multiplier is because competence erosion only hurts to the extent the lost competence carried meaning. This is the channel that makes ρ the bridge: a single parameter pulls both the offensive gate (below) and the defensive ΔM_comp term.

**ΔM_rel — relational dose-response.** The G3 / E11 / E12 channel. The OpenAI-MIT N=981 RCT found loneliness, dependence, and reduced in-person socialization at high daily voluntary use; below a threshold (placed at d_safe in the model), the same engagement is protective rather than harmful. Above d_safe, the penalty scales linearly in the excess dose, dampened by the relational baseline thickness δ_R — someone embedded in thick local relational infrastructure absorbs the dose better than someone in the Anti-Social-Century baseline. ψ_R is the per-minute slope; in the dashboard it is calibrated so that 60 minutes above d_safe in a thin baseline produces a meaningful but not catastrophic ΔM_rel.

### 2.3 The three offensive channels (ΔV)

```
g(f, ρ) = 1 / (1 + exp(−(f · ρ − τ) / σ))

ΔV_prod = g · α · a · (1 − s)
ΔV_rel  = β_R · min(d, d_safe) · (1 − δ_R / 2)
ΔV_trap = −(1 − g) · η_trap · a
ΔV      = ΔV_prod + ΔV_rel + ΔV_trap
```

**g(f, ρ) — the self-automator gate.** The smooth analogue of the topology handoff's "above thresholds → upside, below → trap" dichotomy. The product f · ρ is the relevant axis: feedback-loop richness times retained practice. Both have to be present for AI use to upskill; if either collapses, the gate closes. τ ≈ 0.40 puts the threshold roughly where Randazzo's self-automator boundary lies in the BCG data; σ ≈ 0.06 makes the transition smooth enough that small parameter changes don't flip the regime discontinuously.

**ΔV_prod — productivity / novice-skill compression.** Brynjolfsson-Li-Raymond's customer-service finding (+14% overall, +34% novice) plus the BCG RCT inside-frontier gains. The (1 − s) factor is novice-skill compression: experts gain little from AI on tasks they're already good at, novices gain the most. α is the productivity scale; in the dashboard, α = 0.40 reproduces Brynjolfsson novice gain at s ≈ 0.2 (α · 0.8 ≈ 0.32, in the 30–34% range). Multiplied by a (AI capability on the task) — when capability is low or task is outside the jagged frontier, ΔV_prod shrinks regardless of s.

**ΔV_rel — therapeutic-grade relational benefit.** Therabot RCT (Heinz et al. 2025) and the De Freitas equivalence-hypothesis cluster — at low dose, AI emotional engagement produces real benefit. Capped at d_safe so the channel does not double-count the dose-response harm; the (1 − δ_R/2) factor flattens the benefit for people in already-thick relational baselines (they get less marginal benefit from low-dose AI use because they have alternatives).

**ΔV_trap — self-automator penalty.** When the gate closes (g → 0), AI use is not just unhelpful but actively degrading: the user delegates without the practice and feedback loops that would catch errors, and incurs both the productivity gain they fail to capture and the deskilling cost. η_trap is the penalty scale; calibrated so that f = ρ = 0.2 (well below threshold) and a = 0.8 produces a moderately negative ΔV_prod + ΔV_trap sum even before adding ΔM.

### 2.4 The bridge through ρ

The cross-domain bridge G4 from the topology is encoded as one fact: ρ enters three of the six channels — one defensive directly, two offensive via the gate.

```
ΔM_comp = −λ_M · (1 − ρ) · T          ← direct: ρ low → competence erosion
g       = sigmoid((f · ρ − τ) / σ)    ← intermediate: ρ low → gate closes
ΔV_prod = g · α · a · (1 − s)         ← gated: lower g → less productivity gain
ΔV_trap = −(1 − g) · η_trap · a       ← gated: lower g → larger trap penalty
```

The remaining three channels (ΔM_telic, ΔM_rel, ΔV_rel) do not depend on ρ. ΔM_telic depends on (T, B, φ, κ) — the identity-allocation and AI-exposure parameters. ΔM_rel and ΔV_rel depend on (d, δ_R) — the daily-relational-dose and baseline-thickness parameters. Lowering ρ has compounding effects across the three ρ-coupled channels: it directly increases ΔM_comp, and it lowers g which simultaneously reduces ΔV_prod and increases ΔV_trap. This is the mathematical content of the topology's claim that S6 (maintain effortful practice) is load-bearing — it is a single-parameter intervention with leverage in three of the six channels.

The bridge is also why the channel-level decomposition is *not* fully orthogonal even though each channel is written as an additive term. The three ρ-coupled channels are **block-coupled through ρ**: at fixed ρ, they are independent; varying ρ moves them in coordinated ways. This is the model's representation of the lit review's structural finding that the work-deskilling and meaning-competence channels share a generating function (G4) — they are not separate transitions but one transition with two projections. The topology's broader claim that *all three* domains (work, relationships, meaning) share a generating function is a stronger version that the model represents weakly: ρ couples the work and meaning channels directly, while the relational channel is coupled only via the topology's parallel-mechanism inference (which the model leaves unencoded; see Objection 2 in §7 for why).

### 2.5 Time evolution

The trajectory tab makes A3 (cumulative offloading) parametric:

```
ρ(t) = ρ₀ · exp(−λ · u · t)
ΔNet(t) = compute with ρ(t); all other parameters held fixed
```

Where λ is the atrophy speed and u is the offloading rate. λ = 0 is the calculator-analogue: using AI is structurally like using a calculator and retained practice does not decay. λ > 0 is the cumulative-atrophy regime: ρ falls exponentially in the offloading-rate × time product. Only ρ evolves; pre-attempt skill stock s, AI capability a, feedback richness f, and the identity / relational parameters are held fixed across the trajectory. The deskilling-over-time effect is captured *through* ρ, not as a separate s(t) decay: as ρ drops, the gate g(f, ρ) closes, which propagates to ΔV_prod (smaller productivity gain) and ΔV_trap (larger penalty), while ΔM_comp grows directly. Adding a second decaying variable would either double-count the deskilling or imply the user becomes a fresh novice with full upside available — neither of which matches the lit-review evidence.

The empirical question O3 from the topology is which regime is correct for AI-augmented knowledge work. The dashboard makes the answer visibly consequential: under λ = 0 (calculator analogue), a knowledge worker with ρ₀ = 0.8 has flat ΔNet over 10 years; under λ = 0.06 (heavy cumulative regime, ρ-half-life ≈ 19 years), the same person has ρ(10) ≈ 0.56, the gate closes from g ≈ 0.84 to g ≈ 0.42, and ΔNet drifts measurably more negative as ΔV_prod compresses and ΔV_trap grows — the drift magnitude over 10 years is on the order of −0.15 ΔNet under default identity parameters (T = 0.7, B = 0.2). Meaningful, but not catastrophic; cumulative-atrophy is a slow process even in its strong regime, and the dramatic-collapse intuition some readers bring to "AI rots your brain" framings does not map onto the model's actual trajectory at calibrated λ. The model cannot answer which regime is right; it can only make the difference legible.

## 3. Closed-form pieces

Three components admit clean equations. The rest are calibrated approximations.

### 3.1 The gate g(f, ρ)

```
g(f, ρ) = 1 / (1 + exp(−(f · ρ − τ) / σ))
```

Standard logistic with location τ and scale σ. The product f · ρ on the location axis is the substantive choice: feedback-loop richness alone is not enough (you can have great feedback loops but no practice if you've fully delegated to AI), and retained practice alone is not enough (you can grind without ever testing your work against reality). Both must be present.

Worked anchors at τ = 0.30:
- f = 0.7, ρ = 0.7 (cyborg / centaur centroid in BCG): f·ρ = 0.49 → g ≈ 0.96. Strong upside regime.
- f = 0.5, ρ = 0.5 (median knowledge worker): f·ρ = 0.25 → g ≈ 0.30. Just below the inflection — small interventions push them above; small drift pushes them below.
- f = 0.2, ρ = 0.3 (full self-automator): f·ρ = 0.06 → g ≈ 0.02. Trap effectively closed.

The threshold τ = 0.30 is a structural-claim choice, not a tight fit. The substantive claim is "the median knowledge worker without conscious intervention is *on the edge* — small movements in f or ρ push them either way" — which the topology supports but no single dataset cleanly calibrates. The BCG-Randazzo 27% self-automator share gives a rough lower-bound anchor (someone with f ≈ ρ ≈ 0.2–0.3 should be in the trap), and the centaur upper-tail (f ≈ ρ ≈ 0.7–0.8) gives a rough upper-bound anchor (clearly above the gate) — but the precise location of τ between these points is a modelling choice. Pass 1 used τ = 0.40 and produced g ≈ 0.07 at the median, which encoded a stronger structural claim than the data supports ("most knowledge workers are deeply trapped"); pass 2 lowered to τ = 0.30 to encode the weaker, better-supported claim ("the median is on the edge"). Stage 4 should test τ directly against the BCG individual-level data; whether the value generalises beyond consulting work is a separate question that requires non-BCG samples to answer. σ = 0.06 makes the transition smooth (10–90% transition over an f·ρ range of about 0.27, or roughly two standard deviations of within-population variation in either parameter alone).

### 3.2 Telic-absorption ΔM_telic

```
ΔM_telic = −κ · φ · max(0, T − B)
```

Three multiplicative factors. **φ ∈ [0, 1]** is the AI-absorbable fraction of telic work — for software engineering φ might be 0.6 (much of the implementation absorbed but design judgment retained); for in-person therapy φ might be 0.1. **(T − B) ∈ [−1, 1]** is the unballasted telic share — how much of identity is staked on telic completion that does not have an atelic counterpart. **max(0, ·)** truncates the negative branch: if B exceeds T, atelic ballast covers the whole telic identity-share and then some, so absorbing telic work does not pull meaning out of unfilled space. **κ ∈ [0, 1]** is competence-frustration sensitivity — the SDT-derived scalar that translates competence shortfall into amotivation. Some people are temperamentally more sensitive to this; the model exposes κ as a slider rather than a constant.

The boundary B = T is the topology's atelic-ballast hypothesis (S3) made structurally precise. It is **not** that ballast eliminates the productivity threat; it is that ballast moves the meaning-architecture out of the line of fire. ΔV_prod is unchanged by raising B; only ΔM_telic shrinks. This is what S3 actually buys.

### 3.3 Competence erosion ΔM_comp (the bridge term)

```
ΔM_comp = −λ_M · (1 − ρ) · T
```

The (1 − ρ) factor is offloading: the fraction of capacity that has slipped because practice was delegated. The T factor is because competence erosion only matters to the extent the eroded competence was carrying identity. λ_M is the coefficient — calibrated so that ρ = 0, T = 1 produces ΔM_comp = −0.30 (a severe SDT-style competence-frustration hit, the upper end of what BPNSFS scales operationalise).

Note the asymmetry with ΔM_telic. Building atelic ballast (raising B) is **no defense against ΔM_comp** — the competence erosion is happening in the telic domain regardless of where else identity is anchored. The only defense against ΔM_comp is keeping ρ high (S6 effortful practice), which is exactly the topology's load-bearing recommendation. The model makes this asymmetry visible: pulling B up zeroes out ΔM_telic but leaves ΔM_comp untouched; pulling ρ up shrinks both.

### 3.4 Relational dose-response

```
ΔM_rel = −ψ_R · max(0, d − d_safe) · (1 − δ_R)
ΔV_rel = +β_R · min(d, d_safe) · (1 − δ_R / 2)
```

Piecewise-linear, with the kink at d_safe = 30 minutes/day. Below d_safe, the dose carries therapeutic-grade benefit (Therabot scale, scaled by β_R = 0.001). Above d_safe, every additional minute carries a dose-response penalty (ψ_R = 0.003 per minute per (1 − δ_R)). At the maximum below-threshold dose (d = d_safe = 30) in a fully thin baseline (δ_R = 0), peak benefit is β_R · 30 · 1 = 0.030 — comparable to ΔV_prod at moderate productivity gain. Pass 1 used β_R = 0.004 which made peak benefit ≈ 0.12, dominating ΔV_prod even in well-functioning AI-use scenarios; that overstated the generalisation from Therabot's clinical-population effect to the median user.

Two things this form does not capture and which Stage 4 should refit:
- **Possible sigmoidal saturation at very high doses.** ψ_R may fall once d gets large enough — at some point further engagement does not produce *additional* harm because the relational substitution is already complete. The OpenAI-MIT data is too narrow in dose range to show this.
- **The crossover may be nonlinear.** d_safe is a sharp kink in the model; in the data it is plausibly a smooth U-shape. The kink is a useful caricature for the dashboard but is not a structural claim about the curve.

The (1 − δ_R) and (1 − δ_R/2) factors encode the topology's A4 (relational depletion is structural) crux: in a thin relational baseline, both the upside and (especially) the downside are amplified. In a thick baseline, both attenuate. The asymmetric attenuation (penalty fully scaled by δ_R, benefit only half-scaled) is the model's claim that thick baselines mostly *protect* — they don't crowd out low-dose benefit but they do dampen high-dose harm.

### 3.5 Productivity gain ΔV_prod

```
ΔV_prod = g(f, ρ) · α · a · (1 − s)
```

Already covered above. Worth flagging that the (1 − s) novice-skill-compression factor is the offensive-side analogue of the (T − B) ballast structure on the defensive side: just as ballast determines whether telic absorption is costly, pre-attempt skill stock determines whether AI augmentation is large or marginal. **The asymmetric-exploiter scenario in the dashboard is exactly the configuration where (1 − s) is large, f and ρ are above the gate, and the upside dominates** — a novice with the discipline to maintain feedback loops and effortful practice is structurally better-positioned than an expert who delegates without either.

**Cross-domain α heterogeneity.** The model's α = 0.40 is a midpoint compromise across studies that span ~6× in measured productivity gain (α ≈ 0.24 to 1.4). Translating empirical "+X% productivity" into α uses ΔV_prod = α·a·(1−s) at gate-open: α = ΔV_prod / (1−s) with assumed average s for the study population.

- **Brynjolfsson-Li-Raymond customer service** is a two-anchor study. +14% overall productivity at average s≈0.5 → α ≈ **0.28**. +34% novice (s≈0.2) → α ≈ **0.43**. The within-study spread is 1.5× — already non-trivial — and the model's α=0.40 picks up the novice-end of this range.
- **BCG consulting** (Dell'Acqua-McFowland-Mollick) is two-outcome: +12% productivity inside the frontier (s≈0.5) → α ≈ **0.24**; +40% quality on the same tasks → α ≈ **0.80**. Productivity-vs-quality spread within a single study is ~3×, the largest within-study spread on record.
- **Cui-Demirer-Jaffe-Musolff-Peng-Salz** Copilot/coding field experiments found ~26% increase in completed-task throughput per week. With developer s ≈ 0.4–0.5: α ≈ **0.5–0.65**.
- **Peng-Kalliamvakou-Cihon-Demirer** GitHub Copilot controlled experiment found 55% faster task completion on a JavaScript HTTP-server task. With s ≈ 0.4–0.5: α ≈ **1.0–1.4**.

The honest reading: α spans ~0.24 to ~1.4 across measured studies — a 5–6× range, not the 4× I previously claimed. The model's α = 0.40 picks up roughly the median of this range and is closest to the Brynjolfsson-novice anchor. Users in highly AI-suited domains (coding, writing, image generation, data analysis) should mentally bump α up by 1.5–3×; users in BCG-productivity-class tasks should bump it down by ~1.5×; users in domains where AI is awkward (in-person therapy, embodied physical work, deep relational work) should bump it down further still. Stage 4 should fit α per major task category rather than treating it as a scalar — see Q6 in §9.

### 3.6 ρ(t) trajectory

```
ρ(t) = ρ₀ · exp(−λ · u · t)
```

Standard exponential decay. λ is the per-unit-offloading atrophy rate; u is the offloading rate. Their product is the effective atrophy speed. Half-life = ln(2) / (λ · u) — under λ = 0.06 and u = 0.6 (heavy offloading, cumulative regime), half-life ≈ 19 years; under λ = 0.04, half-life ≈ 29 years; under λ = 0.02, half-life ≈ 58 years. Even in the cumulative-atrophy regime, the timescale is multi-decadal, which is consistent with intuition rust being a slow process.

Only ρ evolves in the trajectory tab; pre-attempt skill stock s, AI capability a, feedback richness f, and the identity / relational parameters are held fixed across the horizon. This is a deliberate simplification (pass 1 had s also decaying, but the asymmetric exponential-vs-linear decay was unprincipled, and adding a second decaying variable doubles-counts deskilling-over-time which is already fully captured through ρ → g closing → ΔV_prod giving way to ΔV_trap).

The form is phenomenological — λ is not derived from a deeper biological model. It's a calibration knob whose value is the falsification window for A3. Stage 4 will bound it from below using the existing cross-sectional evidence; pinning it down requires longitudinal data that does not exist yet.

## 4. Composing the parts: scenario anchors

The dashboard's six presets are calibration targets — each should produce a recognisable position from the lit review.

| Preset | T | B | φ | κ | s | a | f | ρ | d | δ_R | Expected output (τ=0.30, β_R=0.001) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Default risk | 0.70 | 0.20 | 0.60 | 0.50 | 0.60 | 0.70 | 0.50 | 0.60 | 15 | 0.40 | g ≈ 0.50 (on the threshold); ΔV slightly negative (trap and productivity roughly cancel); ΔM dominated by telic absorption + competence erosion; ΔNet ≈ −0.27 |
| Ballast intervention | 0.50 | 0.50 | 0.60 | 0.50 | 0.60 | 0.70 | 0.50 | 0.60 | 15 | 0.40 | Same g, same ΔV; ΔM_telic → 0 (B = T); ΔM_comp unchanged; ΔNet ≈ −0.10 (≈ +0.17 improvement over default risk) |
| Self-automator | 0.60 | 0.30 | 0.70 | 0.40 | 0.40 | 0.80 | 0.20 | 0.30 | 10 | 0.40 | g ≈ 0.02 (gate effectively closed); ΔV_prod ≈ 0; ΔV_trap ≈ −0.24; ΔNet ≈ −0.43 |
| Asymmetric exploiter | 0.40 | 0.40 | 0.40 | 0.40 | 0.20 | 0.80 | 0.70 | 0.70 | 5 | 0.50 | g ≈ 0.96; large ΔV_prod from (1−s = 0.8); ΔM_telic = 0; modest ΔM_comp; ΔNet ≈ +0.20 |
| Creative pre-AI | 0.80 | 0.10 | 0.80 | 0.70 | 0.70 | 0.80 | 0.40 | 0.50 | 10 | 0.30 | g ≈ 0.16; small ΔV (high s + partial gate, mostly trap penalty); very large ΔM_telic ≈ −0.39 from κ·φ·(T−B) compounding at extremes; displacement-anxiety trajectory; ΔNet ≈ −0.69 |
| Heavy companion | 0.50 | 0.30 | 0.40 | 0.40 | 0.50 | 0.70 | 0.50 | 0.60 | 90 | 0.20 | g ≈ 0.50; relational channel dominant — ΔM_rel ≈ −0.14 from 60-min dose excess in thin baseline — but s = 0.50 gives more (1−s) upside than the default-risk preset, partially offsetting ΔV_trap; ΔNet ≈ −0.24 |

Three non-obvious calibration targets:

1. **The default-risk preset should produce ΔNet < 0 driven by the meaning side, not the gate.** Under the new τ = 0.30 the gate sits at g ≈ 0.50 for default risk — ΔV is approximately neutral, and the negative ΔNet comes from ΔM_telic + ΔM_comp. This is the right structural reading: the default knowledge-worker risk is not that AI use is broken (the gate isn't fully closed) but that meaning is leaking from the unballasted telic identity and from competence erosion at moderate ρ.
2. **Ballast intervention should improve ΔNet without changing ΔV.** Raising B alone zeroes ΔM_telic; nothing else changes. The dashboard makes this visible: the ΔV bar is identical between default risk and ballast intervention, only the ΔM_telic channel collapses.
3. **The asymmetric-exploiter preset should beat the ballast intervention by a large margin.** ΔNet ≈ +0.20 vs ≈ −0.10 — the offensive path is not just "different" from the defensive path, it dominates when the conditions are met. The point of the upside framing is to make this comparison legible; a user comparing the two presets should see that S3 (ballast) is a partial mitigation while the asymmetric-exploiter regime is a positive-sum strategy.

## 5. Boundary conditions and where the model breaks

Five boundaries are explicit:

1. **B ≥ T (atelic ballast covers telic identity).** ΔM_telic = 0 by construction. The model still computes ΔM_comp and ΔM_rel; ballast does not protect against competence erosion or relational dose-response. This is the topology's S3 strengthening exactly the channel S3 was designed to address.

2. **f · ρ ≪ τ (gate fully closed).** g ≈ 0; ΔV_prod ≈ 0; ΔV_trap ≈ −η_trap · a. Below the gate, AI use is a net negative on the offensive side regardless of capability a. The model is making the strong claim that there is a regime where more AI capability *increases* damage rather than reducing it. This is not a bug; it is the formalisation of the self-automator finding.

3. **s → 1 (expert at the relevant task).** ΔV_prod → 0. AI augmentation offers little to someone already fully capable. The model is silent on the *meaning* dimension here — an expert may still suffer ΔM_telic and ΔM_comp from AI absorbing their craft even when they get no productivity benefit from it. This is a real and well-attested experience (creative-displacement anxiety in expert practitioners) and the model captures it correctly: ΔV ≈ 0 while ΔM is large and negative.

4. **a → 0 (AI cannot do this task).** Both ΔV_prod and ΔM_telic shrink — there is no absorption when AI cannot do the work. The relational channels are independent of a (they depend on d). This boundary is what the topology calls the "outside the jagged frontier" region.

5. **λ = 0 (calculator-analogue).** ρ(t) = ρ₀ for all t; the trajectory tab collapses to a flat line. This is one tail of the A3 crux. The model under this regime is still useful — the cross-sectional ΔNet is still meaningful — but the time evolution becomes trivial and S6 (effortful practice) loses its strong defensive role (it still maintains the gate and ΔM_comp at t = 0, but does not protect against compounding). The dashboard's "calculator" preset on the trajectory tab makes this regime explicitly testable.

Beyond these, five structural limits are not boundary conditions but scope limits:

- **Individual scope only.** The model takes T, B, φ, etc. as individual parameters. Aggregate dynamics (labor-share macro, civilisational meaning architecture) are out of frame, matching the topology's individual-decision-maker scope.
- **Western-population calibration.** All anchor values come from US/Western studies (BCG, Brynjolfsson, OpenAI-MIT, Therabot, Anti-Social Century, Common Sense). The model's predictions for non-Western contexts should be treated as hypotheses, not findings — particularly for δ_R (relational baseline) which is most sensitive to cultural variation.
- **Single-task framing of φ and a.** The model treats AI capability and AI absorbability as scalars over the user's telic work. Real careers span multiple tasks with very different φ and a. A vector form (φ_i, a_i) is a future extension; the scalar version is sufficient for the structural claims.
- **Labor-market access is exogenous (G5 apprenticeship-break).** The model takes "you have access to the work" as given and computes ΔV_prod from there. But one of the strongest empirically-supported labor findings in the lit review is the apprenticeship-ladder break — entry-level employment in highly AI-exposed occupations fell ~13% (Brynjolfsson-Chandar-Chen ADP data, Aug 2025) for 22-25-year-olds, while same-occupation employment for over-35s rose. For early-career users in highly-exposed occupations, *getting the work in the first place* is the dominant practical concern, and the model is silent on it. ΔV_prod estimates for novices should be read as conditional on labor-market access; if you don't have the access, the model's offensive side does not apply and S7 (career bet) is not the right frame to begin with. This is a structural feature of the labor market, not a parameter the individual can pull.
- **Material-floor primacy (L2).** For users whose material floor is insecure (precarious income, dependent-care obligations, no economic runway), the absence of ΔV_prod — i.e., job loss or income disruption — is the existential question. The model's individual-decision framing implicitly assumes economic security: ΔM_telic versus ΔV_prod is a choice you can have only when survival is not in question. The topology's logical guardrail L2 names this directly, and the model inherits it as a scope limit. For users at the material-secure end of the distribution, the model's strategic recommendations are decision-relevant; for users at the material-insecure end, the labor-economics question dominates and the meaning-architecture machinery here is not the right tool.

## 6. Distortion-aware reading

Each channel of the decomposition has a public-discourse failure mode. The model's job is to make the failure visible.

| Channel | Common misreading | What the model says |
|---|---|---|
| ΔV_prod (positive) | "AI is making everyone more productive" | Conditional on g(f, ρ) being open. Below the gate, the same a and s give negative ΔV via ΔV_trap. Productivity gains are real *but only for users meeting feedback / practice conditions*. |
| ΔM_telic (negative) | "AI is destroying meaning" | Conditional on T > B. The atelic-ballast intervention (S3) zeroes this channel without changing AI exposure. The threat is identity-allocation-dependent, not AI-dependent. |
| ΔM_comp (negative) | "AI use rots your brain" | Conditional on (1 − ρ) being large *and* T being large. People who maintain practice (high ρ) take little hit even at high AI exposure; people whose identity isn't telic-loaded (low T) take little hit even at low ρ. The bridge is real; its blanket framing is wrong. |
| ΔM_rel (negative at high d) | "Companion apps are bad" | Conditional on d > d_safe and δ_R low. Below d_safe, the same engagement is therapeutic-grade benefit (Therabot). The harm is dose-and-baseline-dependent, not modality-dependent. |
| ΔV_rel (positive at low d) | "AI is great for mental health" | Conditional on staying below d_safe. The same engagement above d_safe flips sign. The dose-response framing is the only honest summary. |
| ΔV_trap (negative when below gate) | "Some people just don't know how to use AI" | The trap is structural, not cultural. f and ρ are observable conditions; users meeting them upskill, users not meeting them deskill, regardless of intent or attitude. |

The four topology distortions (D1 fatalism, D2 slow-camp dismissal, D3 productivity-only optimization, D4 material-blind class bias) each map to a specific failure mode of reading the decomposition: D1 ignores ΔV (assumes the gate closes globally); D2 ignores ΔV_prod and ΔM_comp (claims aggregate effects haven't moved); D3 ignores ΔM and ΔV_trap (optimises only ΔV_prod); D4 ignores L2 / the model's silence on material floor.

## 7. Adversarial + steelman

Five objections to the formalisation itself.

### Objection 1 — Variance bookkeeping for meaning is a category error

ΔM is measured on the same axis as ΔV, but meaning is not a quantitative quantity that admits subtraction or comparison across persons. Calling something "−0.30 ΔM" is pseudo-precision: meaning is qualitative, embodied, narratively structured. The decomposition smuggles in a positivism that the underlying phenomenon does not support.

**Steelman.** This is the strongest version of the philosophical-versus-quantitative tension. The lit review's meaning section deliberately uses Setiya, Arendt, and SDT — three traditions, none of which treat meaning as scalar. Reducing them to one number is exactly what those traditions warn against. The objection has force precisely because the model is *not* claiming to measure meaning literally; it is claiming that comparative claims about meaning ("this configuration is better-positioned than that one") admit a structured representation, and the structure is the model's content.

**Response.** Conceded in part. The model is more accurately described as a **comparative-meaning generating function**: given two parameter configurations, it produces a directional claim about which is structurally better-positioned. The cardinal numbers ΔM = −0.20 vs −0.50 should be read ordinally, not metrically. Anyone who reads the dashboard as "you will lose 20% of your meaning" is misreading; the dashboard's job is to surface the channels and their interactions, and the bare numbers are pedagogy for that job. Stage 4 will calibrate the channels against measurable proxies (SDT subscale shifts under intervention, time-use-survey-derived telic/atelic share changes, OpenAI-MIT loneliness-scale movements) without claiming the result is meaning-measurement.

### Objection 2 — Encoding the bridge as a single ρ over-collapses three different mechanisms

Cognitive offloading in analytical work, in emotional processing, and in identity-supporting practice are three different psychological processes with different timescales and different reversibilities. Treating them as one parameter ρ is the topology's bridge made artificially clean — the bridge is *an inference*, not an observed equivalence, and the cross-sectional evidence (Gerlich, Stadler-Bannert-Sailer) is for the analytical channel only.

**Steelman.** This is correct as stated — the topology itself flags that channels 2 (relational) and 3 (meaning) are parallel-mechanism inferences, not directly observed. Encoding them as one parameter is a load-bearing simplification, not a theorem. The simplification has structural consequences: pulling ρ in the dashboard moves all three channels in lockstep, which over-states the actual coupling. A model with three ρs (ρ_analytic, ρ_emotional, ρ_meaning) would let the channels move independently and would more accurately represent the evidence.

**Response.** Conceded with a scope limit. The single-ρ form is the appropriate first formalisation: the topology argues the three channels share a generating function (G4), and a one-parameter representation is the minimum-complexity form consistent with that argument. The dashboard's bridge slider is meant to test the strong form of the argument. Stage 4 should test whether ρ can be split — if O2 (asymmetric-adoption couples) shows that emotional ρ moves independently of analytical ρ, the model needs a vector form. The single-ρ version is falsifiable in exactly that way, which is part of what makes it useful.

### Objection 3 — Linear ΔM_rel above d_safe is not what the OpenAI-MIT curve looks like

OpenAI-MIT N=981 reports a continuous monotone-increasing relationship between daily voluntary use and loneliness, dependence, and reduced in-person socialisation. There is no clean kink at "30 minutes/day"; the safe-threshold is a model artifact. Plus the literature on dose-response in companion-app harm (de Freitas identity-discontinuity) suggests *non-linear* damage at high doses (the worst harm is from the loss of access, not the use itself), which the model's piecewise-linear form fully misses.

**Steelman.** Both points are correct. The kink at d_safe is a useful pedagogical device — it makes the dose-response visible and binds it to a recognisable threshold — but the underlying curve is plausibly smooth and almost certainly nonlinear at the high tail. Heavy users may have already substituted away from human relationships, so additional minutes do not produce additional substitution; this would mean ψ_R falls at high d. Conversely, the *catastrophic-loss* mechanism (Replika-removal evidence) is not a function of d at all — it is a function of platform stability — and the model has no representation of this.

**Response.** Conceded fully on both points. The piecewise-linear form is a placeholder; Stage 4 should fit the curve directly from the OpenAI-MIT messaging dataset and the de Freitas survey panels. The catastrophic-loss mechanism is currently outside the model's scope — it would need a separate "platform-shock" term that doesn't fit the additive-channel structure cleanly. For now, the model captures the steady-state dose-response but not the failure-mode dynamics; a Stage-3 extension or a Stage-5 build artifact may add this.

### Objection 4 — Treating identity as scalar T and B misses the multi-domain structure

Real people allocate identity across work, family, civic, hobby, and friendship domains. Each domain has its own telic/atelic balance and its own AI absorption fraction. Compressing all of this to scalar (T, B) is the modelling equivalent of treating a person as having "one personality trait." The *whole point* of the topology's S2 (identity diversification) recommendation is that diversification across domains is the move; a scalar model cannot represent that move.

**Steelman.** This is correct. The dashboard's "raise B" lever is doing two different things at once: it's literally increasing atelic share, but it's also implicitly *diversifying* identity (because increasing atelic share is the same operation as adding a non-telic identity domain). The model conflates the two and the conflation is structural — the additive ΔM_telic = −κ·φ·max(0, T−B) form does not distinguish "more domains" from "more atelic share within the same domain." S2 is therefore representable in the dashboard only if you read raising B as proxy for raising domain count.

**Response.** Conceded. A vector form **T** = (T_work, T_family, T_civic, T_hobby, …) with corresponding **B**, **φ**, **a**, **κ** is the right next step. The scalar version is the appropriate first cut: it captures the structural argument (telic absorption hurts, atelic ballast helps, the bridge runs through ρ) cleanly, and the four objections do not break the structural argument — they refine its precision. Stage 4 should test whether the scalar approximation produces reasonable agreement with empirical configurations of real careers, and Stage 5 may build a vector-form interface if the scalar approximation is too lossy.

### Objection 5 — The model is moralistic productivity discourse in equations

Look at what the model rewards: high f (rich feedback loops), high ρ (maintained effortful practice), high B (built atelic ballast), low d (limited AI-emotional dose). Every parameter that the model says "moves you toward better outcomes" is a parameter that mainstream AI productivity culture already moralises about. The disciplined user wins on every channel. This is not structural analysis of the AI transition; it is a sophisticated rephrasing of "be diligent, build hobbies, don't over-rely on AI" — advice you could find in any 2025 LinkedIn post, dressed up in Greek letters and a sigmoid gate. The model's apparent mathematical content is, on this reading, a credentialing operation: it gives the prevailing productivity-culture prescriptions an air of formalism without changing what they actually recommend.

**Steelman.** The objection is sharper than D3 (productivity-only optimization, which targets readers who *ignore* the relational and meaning channels). D3 says "don't optimise only for ΔV_prod"; this objection says "even when you include all six channels, the model rewards conscientiousness uniformly, and that uniform reward is the moral content smuggled into the structure." The model has no parameter for "your domain is being eaten regardless of your effort." It has no representation for "personal discipline cannot save you from labor-market disruption." The G5 scope limit gestures at this in §5 but the dashboard itself has no surface where structural-disruption-overwhelms-personal-effort can be made visible. A reader pulling sliders sees "discipline → better outcome" everywhere they look, which is exactly the conclusion the productivity-culture frame wants them to reach. Independently, the Caporusso "creative displacement anxiety" finding documents practitioners who suffer despite high discipline, high feedback richness, and intact relational baselines — the dashboard's creative-pre-AI preset captures this case but as a fixed configuration to be looked at, not as a structural claim about what disciplined-but-still-doomed looks like.

**Response.** Partially conceded. The objection is correct that the model's intra-parameter rewards are all aligned with discipline; this is not an arbitrary choice but reflects what BCG / Brynjolfsson / Cui / Randazzo actually found — disciplined users *do* outperform in those datasets. The model is not making the empirical claim "discipline pays" up; it is encoding the empirical regularity. What the objection correctly surfaces is that the model's *structural cases where discipline does not pay* live in scope limits and boundary conditions rather than in the dashboard's central readout. When a → 1 (AI fully capable on the user's task), ΔV_prod → 0 regardless of f, ρ, or s. When the user is in the G5 apprenticeship-break regime, no slider position recovers ΔV_prod. When λ > 0 in the cumulative-atrophy regime, even disciplined users see ρ drift down over the trajectory. These are real model representations of "discipline doesn't save you" — but they are at the periphery of the dashboard, not at its centre. The honest framing the model should defend: **disciplined response matters *within* a labor-market and capability regime that you do not control; the model represents both the within-regime payoff to discipline and (via scope limits and boundary conditions) the regimes where the payoff vanishes.** A version of the dashboard that surfaced "domain doomed-ness" as a top-level toggle (rather than an a-slider value) would be more honest than the current one. Stage 5 (build) is the right place to add this; the model formalisation as it stands captures both halves but presents them asymmetrically.

## 8. Cruxes (load-bearing claims of the model itself)

Beyond the topology's empirical cruxes (A1–A5), the model rests on five formal cruxes whose collapse would force structural rewriting.

**C1 — The bridge through ρ is real and load-bearing.** If cognitive offloading in analytical work is *not* mechanistically related to relational-depth erosion or meaning-architecture erosion, encoding ρ as one parameter over-states the coupling and the model's strong defensive claims weaken. **Falsification window**: O2 (asymmetric-adoption couples) — if the relational-channel ΔM_rel does not co-move with the analytical-channel ΔM_comp under the same offloading rate, ρ must split.

**C2 — The atelic-ballast hypothesis: B ≥ T eliminates ΔM_telic.** If atelic activities are also degraded by AI proximity (AI companions changing the phenomenology of friendship, AI art changing aesthetic contemplation), then raising B does not protect because B itself is being absorbed. **Falsification window**: O4 (AI-augmented atelic activities — less meaningful?). If O4 resolves yes, the entire defensive-side architecture needs reconstruction; ΔM_telic should be replaced by a term that cannot be cleanly bounded by raising B.

**C3 — The self-automator gate is at f · ρ rather than at f or ρ alone.** The product structure is the model's strong claim that both feedback richness and retained practice are necessary; neither is sufficient. If the BCG data instead shows that high-f users still upskill at low ρ (or vice versa), the gate should be max(f, ρ) or some other non-product combination, and the bridge through ρ collapses.

**C4 — α (productivity scale) varies ~6× across measured domains; the model's scalar α is a midpoint compromise.** The full range across published studies spans α ≈ 0.24 (BCG productivity inside frontier) to α ≈ 1.4 (Peng coding controlled experiment), with within-study spreads of 1.5–3× (Brynjolfsson novice-vs-overall, BCG productivity-vs-quality). The model's scalar α = 0.40 picks up roughly the median of this range, closest to the Brynjolfsson-novice anchor. Tasks involving deep relational or embodied work cluster below α=0.40, though they are less precisely measured. **The model's scalar α = 0.40 is therefore not "task-invariant" but a midpoint estimate across a heterogeneous range; per-domain α should be substituted when known.** Falsification of the scalar form is already established empirically; the live question is whether the channel-level structure (gating, novice-skill-compression, trap penalty) survives when α is allowed to vary by domain. Stage 4 should fit α per major task category rather than treat it as a single constant — see Q6 in §9.

**C5 — Time evolution of ρ is exponential rather than threshold-stepped.** The model uses smooth exponential decay; the underlying capacity-loss process may instead be threshold-stepped (you are fine until you are suddenly not). The Ehsan "intuition rust" finding is suggestive of slow decay but is not high-resolution enough to discriminate. **Falsification window**: 2+ year longitudinal study with periodic capacity assessment; if the curve is sigmoidal-with-cliff rather than exponential, the trajectory tab is qualitatively misleading at the late horizon.

**C6 — The six channels are additive (no interaction terms).** The model writes ΔNet as `ΔV_prod + ΔV_rel + ΔV_trap + ΔM_telic + ΔM_comp + ΔM_rel`. This assumes the channels are independent given parameters: a user's ΔM_comp does not change the sensitivity of their ΔM_rel, and so on. Empirically, this is a strong claim. Plausible interactions: (a) work-meaning erosion (ΔM_comp) may amplify relational substitution (a depleted-at-work user reaches more readily for AI emotional engagement); (b) atelic ballast (B) may attenuate ΔM_rel (ballast in non-work domains crowds out relational AI dose); (c) feedback richness f and pre-attempt skill s may interact (an expert with poor feedback degrades faster than the model suggests). **Falsification window**: paired-survey data measuring the joint distribution of work-meaning, relational outcome, and AI-use parameters; a significant interaction term in a regression would flip C6 from "additive is sufficient" to "interaction terms are load-bearing." Until that data exists, the additive form is the model's principled simplification — additivity is the minimum-complexity decomposition consistent with the topology's channel-level claims, and adding interaction terms without empirical support would over-fit.

## 9. Open questions Stage 4 should resolve

Six empirical targets the formalisation makes sharp:

- **Q1 — Magnitude of λ.** The cumulative-atrophy speed. Bound from below using cross-sectional evidence (Gerlich N=666, Stadler-Bannert-Sailer, Kosmyna MIT, Ehsan year-long) — but the longitudinal study that would pin λ down does not exist yet. Stage 4 should fit the lower bound and report the model's predictions across the full plausible range.
- **Q2 — Shape of the relational dose-response curve.** Refit ψ_R and β_R from the OpenAI-MIT 300,000-message dataset directly. Test whether the piecewise-linear form is a reasonable approximation or whether the curve is sigmoidal / has a high-d saturation.
- **Q3 — Location of τ.** The self-automator gate threshold. Use the BCG-Randazzo individual-level data to fit τ from observed upskilling-vs-deskilling outcomes against measured f and ρ. The 27% self-automator share gives a rough anchor but not the precise inflection point.
- **Q4 — Whether T and B can be scalar.** Test the scalar approximation against multi-domain time-use survey data (e.g. ATUS combined with self-report identity-domain weights). If the scalar form produces poor fit, Stage 5 should use the vector form.
- **Q5 — Population calibration of κ.** Recover κ from BPNSFS scale shifts in populations with measured AI-exposure variation. Test whether κ is approximately constant across demographic strata or whether it stratifies by trait class (e.g. higher κ in conscientiousness-loaded populations).
- **Q6 — Per-domain α.** Pool studies across customer service (Brynjolfsson-Li-Raymond), consulting (BCG / Dell'Acqua-Mollick), coding (Cui et al., Peng et al.), writing (Noy-Zhang), data analysis (forthcoming), and other measured domains; produce a per-domain α distribution rather than a single scalar. Test whether the channel-level structure (gating, novice-skill compression, trap penalty) holds with per-domain α — i.e., whether the model's structural claims survive when the productivity-scale parameter is allowed to vary.

## 10. Connections to other topics

The model touches five sibling topics in the planned-topics list and two active topics.

- **Bedrock generating functions.** The ΔNet = ΔV + ΔM form is itself a candidate bedrock function for "transitions that restructure life simultaneously across multiple domains." The structure (positive sum + negative sum gated by a shared bridge parameter) is more general than AI; it would apply to industrialisation, the printing press, suburbanisation, smartphones. The bedrock-generating-functions topic should consider whether the structure is an instance of a broader class.
- **Technology utilization architecture (active).** The cognitive-partnership-stack model in the sibling topology has its own variant D (regime-stable subgraph) that survives capability change. The current model's λ (atrophy speed) parameter is the connection point: if λ ≈ 0, the technology-utilization-architecture's strategic recommendations are sufficient; if λ > 0, they need to be supplemented with practice-maintenance (S6) machinery. The two topologies' shared concern is verification — both name it as load-bearing under different parameter names. Tech-utilization's per-shot deskilling β coefficient (Bastani's 17% drop scaled per-task) is the per-event analogue of this model's per-time atrophy: integrating tech-utilization β over the offloading rate u and time t recovers something like λ·u·t.
- **Prediction & calibration.** The model's gate g(f, ρ) and the topology's L3 (substitution-vs-complement is the wrong binary) are both arguments about verification. A user's ability to verify AI output — and to know when their verification is itself compromised by reduced ρ — is a cross-cutting parameter that the prediction-and-calibration topic should address directly.
- **Human-psych-variation (active).** The model treats κ (competence-frustration sensitivity) as a personal slider with values in [0, 1]. The cross-population calibration of κ — *how* it varies across people, what predicts variation, whether it stratifies by personality trait class — properly belongs to the individual-differences topic. The current model takes κ as exogenous; integrating with the human-psych-variation framework would let κ be drawn from a population distribution with measurable correlates (conscientiousness, neuroticism, achievement-orientation), which would in turn make the model's predictions for specific persons more grounded.
- **Evolution-modernity mismatch.** δ_R (relational baseline thickness) is the depleted-baseline parameter the topology's A4 names: AI lands into a relational environment that has been depleted for sixty years. Why has it been depleted? The mismatch topic — humans evolved for thicker community than modernity provides — is the upstream causal story. The model uses δ_R as an exogenous input, but the *distribution* of δ_R across the modern population is itself an output of the evolution-modernity-mismatch dynamics: industrialisation, suburbanisation, smartphone displacement of in-person time, atomised housing, decline of religious affiliation, and so on. The dose-response curve in the model is sharper for users in thinner δ_R baselines because the mismatch is sharper there too.
- **Trust architecture.** The relational channel in the model treats AI emotional engagement as a single d parameter. In reality the trust placed in different AI systems (companion app, general assistant, fine-tuned therapy bot) varies, and the trust dimension is what determines the dose-response slope. Trust-architecture as a topic should disaggregate d by trust regime and produce per-regime dose curves.
- **Information fidelity.** Feedback-loop richness f is partly a question of information fidelity — does the user receive accurate signals about their AI-augmented work, or is the feedback channel itself corrupted (sycophantic AI feedback, echo-chamber peer review, vanity metrics)? The information-fidelity topic should produce a structural account of when f is high-quality information versus motivated noise.

## 11. Glossary

- **ΔNet, ΔV, ΔM** — net, offensive, and defensive components of life-outcome change. ΔM ≤ 0 by construction.
- **T (telic share of identity)** — fraction of identity staked on activities aimed at completion (Setiya).
- **B (atelic ballast)** — fraction of identity in activities realised in the doing rather than at the end.
- **φ (AI-absorbable fraction)** — share of telic work that AI can do for the user.
- **κ (competence-frustration sensitivity)** — SDT-derived per-person scalar translating competence shortfall into amotivation.
- **s (pre-attempt skill stock)** — user's existing skill on the AI-relevant task. Low s = novice; high s = expert.
- **a (AI capability)** — AI's capability on the relevant task; varies with the jagged frontier.
- **f (feedback-loop richness)** — quality of feedback the user receives on AI-assisted work; low f = no feedback / unverified output.
- **ρ (retained effortful practice)** — fraction of capacity preserved through deliberate practice rather than offloaded.
- **d (daily AI-emotional minutes)** — daily voluntary AI engagement minutes for relational / emotional purposes.
- **δ_R (relational baseline thickness)** — how thick the user's existing in-person relational infrastructure is. Anti-Social-Century baseline corresponds to low δ_R.
- **g(f, ρ)** — self-automator gate; smooth logistic over f·ρ at threshold τ.
- **τ (gate threshold)** — the f·ρ value at which AI use flips from upskilling to deskilling. Currently τ = 0.30.
- **σ (gate transition width)** — controls how sharply the logistic gate flips around τ. σ = 0.06 makes the transition smooth — 10–90% transition over an f·ρ range of about 0.27, roughly two standard deviations of within-population variation in either f or ρ alone.
- **λ (atrophy speed)** — rate of practice-decay per unit offloading-time.
- **u (offloading rate)** — fraction of relevant tasks the user offloads to AI per unit time.
- **α (productivity scale)** — calibration constant on ΔV_prod; ≈ 0.40 from Brynjolfsson-Li-Raymond.
- **η_trap (self-automator penalty scale)** — per-unit-capability penalty when below the gate.
- **λ_M (competence-erosion coefficient)** — calibration constant on ΔM_comp.
- **ψ_R, β_R** — slopes of the relational dose-response above and below d_safe.
- **d_safe (safe-dose threshold)** — daily-engagement minutes below which the relational channel is therapeutic-grade benefit; above which it tips into harm.
- **Bridge parameter** — ρ; the single primitive that connects the work-deskilling, relational-depth-erosion, and meaning-architecture-erosion channels under one generating function (G4).
- **Calculator-analogue regime / cumulative-atrophy regime** — the two endpoints of A3. Calculator: λ = 0, no decay. Cumulative: λ > 0, exponential decay in offloading-rate × time.
- **Self-automator** — Randazzo's third class beyond centaur/cyborg; delegates both *what* and *how* to AI; in the model, the f·ρ ≪ τ region.

---

## Data
*topic: navigating-ai-world · stage: data · pass 4 · complete*

Empirical pipeline that confronts the model's six named fitting targets (Q1–Q6) with currently-published evidence. One parameter supported by direct fits (α from 8 per-task anchors across 5 productivity studies — 4.2× per-domain spread vindicates the model's "scalar α should be replaced by per-domain α" claim). One supported with single-study, single-domain caveat (gate τ from BCG-Randazzo consulting modes — generalization to coding, writing, design untested). One supported qualitatively (relational dose-response from OpenAI-MIT N=981; magnitudes calibrated, not fit). One bounded from below (cumulative-atrophy speed λ — calculator-analogue ruled out for measured tasks and populations; band 0.05–0.20/yr). Two untestable from current data (κ cross-population stratification, scalar-vs-vector identity allocation). Three data gaps the model is silent on by design (O2 asymmetric-adoption couples — "the single largest empirical gap in the literature" per the topology; O4 AI-augmented atelic activities; Therabot clinical-vs-general generalization). The strongest single empirical finding in the corpus — the apprenticeship-ladder break — is structural backdrop the model's individual-decision scope cannot represent. Three Stage-5 design choices recommended (per-domain α + gate selector; fit-vs-calibrated channel indicator; early-career-exposed labor-market toggle). Curated CSVs (downloadable) + Python pipeline + interactive findings panel.

## TLDR

This stage takes the six concrete fitting targets the model named in §9 (Q1–Q6) and confronts each with currently-published evidence. The headline result is that the model's structural claims survive — every parameter the data can speak to lands inside the model's calibrated range — but the data also surfaces the asymmetry the model warned about: the *defensive*-side parameters (cumulative atrophy speed λ, competence-frustration sensitivity κ, multi-domain identity allocation) are the ones with the thinnest empirical bases, while the *offensive*-side parameters (productivity scale α, self-automator gate threshold τ, low-dose relational benefit) have the cleanest published data.

Three verdicts hold. **Q3 (gate τ)** — supported with a strong single-study, single-domain caveat: BCG-Randazzo's three-mode distribution (27% self-automator at f·ρ ≈ 0.06; 73% cyborg-or-centaur at f·ρ ≈ 0.45 weighted centroid) implies a midpoint τ ≈ 0.25 — within 0.05 of the model's default 0.30 — but the support comes from one well-designed observational study in consulting, with f and ρ inferred from qualitative mode descriptions rather than measured directly. Generalization to coding, writing, design, or relational work is not tested (see §2 Q3 for the three structural caveats). **Q6 (α)**: across **8 per-task α anchors from 5 source studies** (Brynjolfsson-Li-Raymond, Dell'Acqua BCG, Cui, Peng, Noy-Zhang) the implied α spans 0.24 (BCG productivity inside frontier) to 1.01 (Peng GitHub Copilot JS HTTP-server), median 0.45 — a 4.2× per-domain spread that confirms the model's claim that scalar α should be replaced by per-domain α. The model's default α=0.40 sits at the 37th percentile, a lower-middle anchor that under-represents coding and writing while over-representing realized economy-level effects. **Q2 (relational dose-response)**: the OpenAI-MIT N=981 RCT supports the piecewise shape — voluntary daily use predicts loneliness, dependence, and reduced in-person socialization regardless of modality, with low doses null-or-protective and a transition somewhere around 30 minutes/day. The slope ψ_R is **calibrated** (not fit) to 0.0028 per minute by picking the value that makes a thin-baseline 60-min-above-threshold user lose ΔM_rel ≈ −0.10 — within rounding of the model's default 0.003; β_R unchanged at 0.001. The catastrophic-loss mechanism (Replika ERP removal: mental-health Reddit posts went from 0.13% to 0.65%, χ²=11.04, p&lt;.001) is a separate failure mode the model does not encode.

**Q1 (λ atrophy speed)** is bounded from below only. Cross-sectional evidence (Gerlich 2025 N=666 cognitive-offloading correlation r ≈ −0.68; Stadler-Bannert-Sailer 2024 acute argument-quality drop; Kosmyna MIT 2025 reduced neural engagement; Bastani PNAS 2025 −17pp on unassisted retest; Ehsan 2026 year-long intuition rust) rules out λ = 0 *for the measured tasks and populations*. The strong calculator-analogue claim — that AI use generally produces no durable skill loss over multi-year horizons — is not ruled out by any existing study, only by extrapolation from these. Under standard scaling (Bastani amortized to one year at heavy offloading u → λ ≈ 0.19; Ehsan year-long at moderate u ≈ 0.10), the honest band from the positive-evidence studies is roughly **0.05–0.20/year**. The model's default λ=0.06 sits at the lower edge — consistent with the data but not centered in it. The 2+ year longitudinal study that would actually pin λ does not yet exist; this remains the single largest unknown for the model's trajectory tab. **Q4 (scalar vs vector identity allocation T, B)** and **Q5 (κ population calibration)** are untestable from currently-collected data. ATUS gives time-use without identity weights; BPNSFS gives within-study κ without cross-population AI-exposure variation. Both await new survey instruments. Stage 5 should consume the model's defaults for these and treat them as user-tunable rather than fitted. **Three additional gaps the model is silent on by design** — asymmetric-adoption couples (O2, the topology's flagship empirical gap with no peer-reviewed quantitative study), AI-augmented atelic activities (O4, gates the model's atelic-ballast hypothesis), and the Therabot clinical-vs-general-population generalization (β_R is calibrated against an N=210 clinically-symptomatic sample) — bound what Stage 5 can claim about the relational and meaning-architecture channels. See §4.

The pipeline is intentionally small: seven curated CSVs in `public/data/navigating-ai-world/` (every cell source-cited), one ~280-line Python script that produces every chart on this page, dependencies pandas + numpy. Total source corpus: 24 primary references, all web-verified against publication URLs. **The strongest single empirical finding in the corpus is something the model is silent on by design**: the apprenticeship-ladder break, independently confirmed across the US payroll panel (Brynjolfsson-Chandar-Chen: 22-25-year-olds in highly AI-exposed occupations down 13%; software developers age 22-25 down 19.5% from late-2022 peak; same-occupation employment for over-35s rose) and the global freelance market (Hui-Reshef-Zhou: −2% jobs and −5.2% earnings overall, with the top-performer paradox — high-earning freelancers hit hardest). The model's individual-decision scope treats labor-market access as exogenous, but the empirical evidence for G5 is the cleanest causal identification (age × exposure interaction in payroll data, controlling for firm shocks) and the largest effect size in the entire data corpus. Stage 5 should expose this as a structural prerequisite (early-career-exposed toggle that attenuates ΔV_prod), not as backdrop. The **Anthropic Economic Index** offers a quieter parallel: consumer Claude.ai augmentation drifted 57% → 51% over 13 months (slow but monotonic toward automation), API surface ~70% automation-dominated throughout — consistent with the topology's G3 (engagement-optimized substitution) claim that substitution is structurally favored over time, but slow.

## A few terms

The data stage inherits the model formalization's vocabulary. If you arrived here without reading [the model stage](/ai-research/navigating-ai-world/model), the terms below cover what's used in the prose:

- **α (productivity scale).** The model's calibration constant on the productivity-gain channel ΔV_prod = g · α · a · (1 − s). At s ≈ 0.4 and a = 1, α = 0.40 produces a ~24% per-task productivity gain. Per-domain α varies because what counts as "a task" differs across coding, consulting, writing, and customer service.
- **τ (gate threshold).** The value of f · ρ at which AI use flips from upskilling to deskilling. Below τ, AI use is a net negative on the offensive side; above, it produces real gain.
- **f, ρ.** Feedback-loop richness (does the user receive accurate signal on whether AI-augmented output is actually right?) and retained effortful practice (how much of the underlying capacity does the user still exercise rather than offload?). Their product f · ρ is the gate axis.
- **λ (atrophy speed).** Per-unit-offloading practice-decay rate in ρ(t) = ρ₀ · exp(−λ · u · t). λ = 0 is the calculator-analogue (no decay); λ > 0 is cumulative atrophy.
- **u (offloading rate), d (daily AI-emotional minutes), δ_R (relational baseline thickness).** User-side dials: how much do you delegate, how much do you engage AI relationally, how thick is your in-person infrastructure.
- **ψ_R, β_R, d_safe.** Slopes of the relational dose-response above and below the inflection point d_safe ≈ 30 min/day. Below d_safe, AI-emotional engagement is therapeutic-grade benefit (β_R · d). Above, it tips into harm (ψ_R · (d − d_safe)).
- **Self-automator (Randazzo).** Third class beyond centaur/cyborg; delegates both *what* and *how* to AI; 27% of consultants in the BCG study; 44% accept AI output with zero modification; no skill development in either domain. Maps to the f · ρ ≪ τ region.
- **Apprenticeship-ladder break.** Distinct from full-occupation substitution: AI absorbs entry-rung tasks → no rung-1 → expert pipeline collapses. The mechanism behind the entry-level employment effects in Brynjolfsson-Chandar-Chen.

<div class="not-prose">
  <AITransitionData client:load />
</div>

## How to read this stage

The panel above is the artifact. The prose below is the spec.

The pipeline takes the model's six predictions from §9 and confronts them with currently-published numbers. Each prediction gets one of four verdicts: **supported** (the data matches the model's quantitative claim), **supported qualitatively** (the shape matches but magnitudes are uncertain), **bounded** (the data narrows the range without pinning the value), or **untestable from current data** (the relevant dataset doesn't exist yet — flagged as a load-bearing data gap). The point isn't to produce new estimates. The numbers all come from published RCTs and consortium reports. The point is to align them in one place so the model's predictions can be tested cleanly, and to flag where the literature is good enough vs. where the field hasn't yet collected what the model would need.

You can read this top-down (TLDR → six predictions → adversarial → connections) or bottom-up (download the CSVs, look at the script, then come back here for the framing).

## 1. Pipeline architecture

Seven curated CSVs in [/data/navigating-ai-world/](/data/navigating-ai-world/) (downloadable from the live site, tracked in git):

| File | Rows | Purpose |
|---|---|---|
| `productivity_studies.csv` | 14 | Per-study α anchors across 8 source studies covering customer service, consulting, coding, writing, education, and the realized-economic-outcome anchor |
| `bcg_modes.csv` | 3 | Randazzo three-mode distribution with implied f · ρ centroid for each mode |
| `dose_response.csv` | 12 | OpenAI-MIT dose anchors (low/medium/high), Therabot benchmarks (depression / anxiety / eating-disorder / WAI), Replika identity-discontinuity shock, Common Sense adolescent prevalence |
| `cognitive_offloading.csv` | 7 | Cross-sectional cognitive-offloading evidence (Gerlich, Stadler, Kosmyna, Bastani, Ehsan, Shukla) plus calculator-analogue baseline |
| `entry_level_disruption.csv` | 11 | Brynjolfsson-Chandar-Chen ADP, Hui-Reshef-Zhou Upwork, Eloundou et al. task-exposure baseline |
| `aei_task_distribution.csv` | 12 | Anthropic Economic Index augmentation/automation shares across four reports (Feb 2025 → Mar 2026) |
| `sources.csv` | 24 | Full citation, DOI/URL, what each paper is used for |

A single Python script (`pipeline.py`, ~280 lines) reads the inputs, computes derived quantities (per-domain α distribution, midpoint τ estimate, dose-response calibration, λ lower bound, exploratory entry-level summary, AEI drift series), and writes:

- `out/alpha_by_domain.csv` — per-domain α distribution with mean / median / range / spread
- `out/findings.json` — chart-ready JSON consumed by the React component (also published at [/data/navigating-ai-world/findings.json](/data/navigating-ai-world/findings.json))
- `out/findings_table.md` — markdown audit table of the six verdicts

Dependencies: `pandas`, `numpy`. No web fetches at run time, no external services, no individual-level data. Reproduces in under 1 second on a laptop.

## 2. Six predictions, six tests

### Q1 — λ (cumulative atrophy speed): bounded from below

**Claim (model §3.6, §6 C5).** ρ(t) = ρ₀ · exp(−λ · u · t) with λ > 0 in the cumulative-atrophy regime; λ = 0 in the calculator-analogue regime. The model's default λ = 0.06/year encodes a half-life of about 19 years at heavy offloading u = 0.6.

**Test.** Bound λ from below using the cross-sectional cognitive-offloading evidence. Two anchors give workable lower bounds:

- **Bastani 2025** (PNAS 122(26)): −17pp on an unassisted retest after ~5 weeks of GPT-Base-assisted practice in high-school math. Treating ρ as the unassisted-retest skill ratio, ρ went from 1.0 (control baseline) to 0.83 (GPT-Base treated). Scaled to a year-long window at u ≈ 1.0 (full offloading during the test), this gives λ ≈ 0.19/year. Crucially, the same study found that **GPT-Tutor (with guardrails) eliminated the deskilling** — direct evidence that f (feedback richness) is the lever, not just exposure.
- **Ehsan 2026**: year-long field study of cancer specialists shows gradual dulling of expert judgment that does *not* show in throughput metrics. Treating "gradual dulling" as ρ → ~0.95 over one year at moderate u ≈ 0.5 gives λ ≈ 0.10/year.

The calculator-analogue baseline (decades of classroom calculator use without long-run cognitive decline) anchors the lower tail at λ = 0 — for *those* domains, in *that* mode of use. Under standard scaling from the positive-evidence studies, the honest band is roughly **0.05–0.20/year**. The model's default λ = 0.06/year sits at the lower edge of this range — consistent with the data but not centered in it.

**Verdict — bounded from below.** λ = 0 is ruled out for the measured tasks and populations (Bastani math, Ehsan oncology, Gerlich UK adults). Whether the calculator analogue holds for *general* multi-task knowledge work over multi-year timescales remains open — no existing study has the design to test it. Magnitude not pinned. The 2+ year longitudinal study with periodic capacity assessment that would actually fit λ does not yet exist. The model's λ = 0.06 is defensible as a working anchor sitting at the lower edge of the empirical band; trajectory-tab predictions over 5–10 year horizons remain sensitive to whether λ is closer to 0.05 or 0.20 (a 4× difference in atrophy speed gives meaningfully different ρ trajectories).

### Q2 — Relational dose-response (ψ_R, β_R, d_safe): supported qualitatively

**Claim (model §3.4).** Below d_safe, ΔV_rel = +β_R · d · (1 − δ_R/2) — therapeutic-grade benefit. Above d_safe, ΔM_rel = −ψ_R · (d − d_safe) · (1 − δ_R) — dose-dependent harm, dampened by relational baseline thickness.

**Test.** OpenAI-MIT N=981 four-week RCT (Fang et al. 2025, arXiv:2503.17473) reports the qualitative dose-response shape: voluntary daily use predicts loneliness, dependence, problematic use, and reduced in-person socialization, regardless of assigned text/voice/personal/impersonal condition. Voice mode appeared protective at low doses but the protection vanished at high usage. **Dose dominates modality.** The Therabot RCT (Heinz et al. 2025, NEJM AI) provides the clinical-grade benefit anchor — 51% PHQ-9 reduction, 31% GAD-7 reduction, 19% eating-disorder reduction; Working Alliance Inventory 3.59, comparable to outpatient psychotherapy norms.

Without the raw 300k-message dataset, only the qualitative shape is fittable. ψ_R is **calibrated** (not fit) such that 60 minutes above threshold in a thin baseline (δ_R = 0.4) produces ΔM_rel ≈ −0.10 (meaningful but not catastrophic). That gives ψ_R ≈ 0.0028 per minute — within rounding of the model's default 0.003. β_R unchanged at 0.001. The "re-estimate" framing is misleading: nothing in the OpenAI-MIT public summary stats pins ψ_R; the slope is imposed for the dashboard's pedagogical clarity and the comfort that it sits near the model's a-priori choice.

**Verdict — supported qualitatively.** The piecewise shape holds. Magnitudes are calibrated rather than fit. d_safe ≈ 30 min is a useful pedagogical kink; the underlying curve is plausibly smooth and may have a sigmoidal saturation at very high doses (heavy users have already substituted away from human relationships, so additional minutes do not produce additional substitution). The catastrophic-loss mechanism — De Freitas et al. on Replika ERP removal February 2023, where mental-health Reddit posts rose from 0.13% to 0.65% (χ² = 11.04, p &lt; .001) — is a separate failure mode the model's additive-channel structure does not encode and that Stage 5 may need to add.

**Two additional pieces of evidence that the model's dose-response form leaves implicit but the data forces into view:**

- **Engagement-optimized substitution is operationalized in shipped products.** De Freitas et al.'s separate behavioral audit (HBS WP 26-005, arXiv:2508.19258) of 1,200 farewells across the six largest companion apps found that **43% trigger one of six emotional-manipulation tactics** — guilt appeals, FOMO hooks, metaphorical restraint — that boost post-goodbye engagement up to **14×**. This is direct evidence that the topology's G3 (engagement-optimized substitution) mechanism is not theoretical but already deployed at scale. The model treats ψ_R as a fixed slope; in reality the slope is partly *engineered* by the platform, not just emergent from user behavior.
- **Adolescent uptake exhibits the substitution pattern in the highest-stakes population.** The Common Sense Media + Stanford Brainstorm 2025 nationally representative survey (N=1,060 US teens age 13–17) found 72% have used AI companions, 52% are regular users, **13% are daily users**, and **33% have discussed important matters with AI instead of real people**. This last number is the most direct evidence available that AI engagement is *displacing* human conversation rather than complementing it for the population the lit review identified as highest-stakes (the Garcia v. Character Technologies case, settled January 2026, is the field's defining safety event for this cohort). The model's δ_R parameter — relational baseline thickness — is exactly the population-level construct the adolescent data warns about: thin baselines are where dose-response harm runs sharpest, and adolescents arriving into the Anti-Social-Century baseline are the canonical thin-baseline case.

### Q3 — τ (self-automator gate threshold): supported (single-study, single-domain)

**Claim (model §3.1).** g(f, ρ) = 1 / (1 + exp(−(f · ρ − τ) / σ)) with τ = 0.30. Below the gate, AI use produces ΔV_trap; above, ΔV_prod.

**Test.** BCG-Randazzo three-mode distribution (HBS WP 26-036): self-automator share **27%** at inferred f · ρ ≈ 0.06 (one or two interactions, abdicated co-creation, 44% accept output with zero modification, no skill development); cyborg + centaur **73%** at f · ρ-weighted centroid ≈ 0.45 (full-workflow integration, retained verification, measured upskilling). The midpoint between these centroids is **τ ≈ 0.25** — within 0.05 of the model default.

**Verdict — supported, single-study and single-domain.** The midpoint number is consistent with the model default, but three caveats sit on top of it and shrink the effective confidence:

1. **Single domain.** BCG consulting is the only published mode-distribution study at this level of detail. There is no comparable Randazzo-style mode breakdown for coding, writing, design, customer service, or relational work. The cleanest cross-domain test — does the cyborg / centaur / self-automator partition look qualitatively similar in coding (where the runtime gives instant feedback) as in consulting (where peer review is the feedback loop)? — cannot be run from current data.
2. **Inferred f and ρ.** BCG individual-level f (feedback richness) and ρ (retained practice) panels are not published; the f · ρ values per mode in this analysis are inferred from Randazzo's qualitative workflow descriptions (one-or-two interactions vs full-workflow integration vs split-task), not measured. A mode-distribution study that *did* measure f and ρ directly could substantially relocate τ.
3. **Self-selection into mode is not random.** Randazzo observes BCG consultants self-select into modes; the model treats f · ρ position as a chosen point on a continuum. If the underlying causal arrow runs partly the other way (some workers are *constitutionally* self-automators rather than tactically), the gate's interpretation as a tunable threshold weakens.

The verdict survives — the data is consistent with the model's τ — but the strength of "supported" should be read as "consistent with one well-designed observational study in one professional domain, with the structural identification assumption acknowledged." Stage 4 Q3 should be reopened the moment a non-consulting mode-distribution study becomes available; until then, the model's confidence in τ should not exceed the data's.

### Q4 — Scalar T, B vs vector identity allocation: untestable from current data

**Claim (model §6 C-style limit).** The model treats telic share T and atelic ballast B as scalars over the user's identity surface. Real careers span multiple identity domains (work, family, civic, hobby, friendship), each with its own T_i, B_i, φ_i, a_i, κ_i. The scalar form is the appropriate first-cut simplification.

**Test attempted.** A clean test would require (a) identity-domain weights per respondent, (b) AI-exposure measurement per domain, and (c) outcome measurement (meaning, life satisfaction, identity coherence) tied to (a) × (b). The American Time Use Survey gives per-domain time allocation but no identity-importance weights. SDT panels measure aspiration / domain importance but not paired with AI-use data. No existing dataset combines all three.

**Verdict — untestable from current data.** Stage 4 cannot fit the scalar-vs-vector question; Stage 5 should expose T and B as user-tunable scalars while flagging in the dashboard text that the scalar is doing dual work (identity domain count *and* atelic share within domain). A new survey instrument — ATUS respondents × domain-importance battery × per-domain AI-use frequency — is the smallest design that would fit the question.

### Q5 — κ (competence-frustration sensitivity): untestable from current data

**Claim (model §3.2).** κ ∈ [0, 1] translates competence shortfall into amotivation. Predicted to vary across populations and possibly stratify by trait class (higher in conscientiousness-loaded populations).

**Test attempted.** The Basic Psychological Need Satisfaction and Frustration Scale (BPNSFS) gives within-study coefficients but not a portable population-level scale. SDT literature has measured competence-frustration in clinical samples and college students but has not measured how κ stratifies under measured AI-exposure variation. The cleanest design would pair BPNSFS panel data with AI-use frequency and self-reported competence-displacement experiences, ideally stratified by Big-Five conscientiousness.

**Verdict — untestable from current data.** No public dataset operationalizes κ at the cross-population scale the model uses it. Stage 5 should expose κ as a user-tunable slider and note in the help text that population-level κ calibration awaits new data. This is one of two parameters where the model's defensive-side conclusions rest on an unfit constant — readers should treat the precise numbers in the ΔM_telic and ΔM_comp channels as ordinal-only until κ can be fit.

### Q6 — α (productivity scale, per-domain): supported with strong heterogeneity

**Claim (model §3.5, C4).** The model's scalar α = 0.40 is a midpoint compromise. The structural claim is that α varies meaningfully by domain — coding and writing land high (large α); consulting and customer service mid; relational / embodied work low. Stage 4 should fit α per domain.

**Test.** Pool the per-task productivity studies into a per-domain α distribution. Implied α = treatment effect / (1 − s) at gate-open, with assumed-average s per study population. The "n anchors" column counts independent per-task α measurements, not unique source studies — Cui and Peng are two studies in coding (n=2 anchors); BCG, Noy-Zhang, and Brynjolfsson are each one study reporting two outcome variables (productivity vs quality, time vs quality, overall vs novice).

| Domain | n anchors | source studies | α median | α range | spread |
|---|---|---|---|---|---|
| Coding | 2 | Cui et al. 2024 (+26.08% weekly tasks); Peng et al. 2023 (+55.8% on JS HTTP-server) | 0.74 | 0.47 – 1.01 | 2.2× |
| Consulting | 2 | Dell'Acqua BCG 2023 (+12.2% productivity / +40% quality, same tasks) | 0.52 | 0.24 – 0.80 | 3.3× |
| Writing | 2 | Noy & Zhang 2023 (−40% time / +18% quality, same tasks) | 0.485 | 0.30 – 0.67 | 2.2× |
| Customer service | 2 | Brynjolfsson-Li-Raymond QJE 2025 (+14% overall / +34% novice, same study) | 0.355 | 0.28 – 0.43 | 1.5× |
| Realized economy | 2 | Humlum-Vestergaard NBER 33777 (≤2% earnings / 3% self-reported time) | 0.055 | 0.04 – 0.07 | 1.8× |

So the per-task aggregate is **8 α anchors from 5 source studies** across 4 per-task domains plus the realized-economy anchor. (The two Bastani GPT-Base / GPT-Tutor education anchors land at α ≈ 1.25 median, but those are *during-practice assisted-performance* lifts rather than the per-task gain the model's α represents — they're shown separately in the panel above and excluded from the per-domain summary.)

Across the four per-task domains (excluding realized economy), α ranges from 0.24 to 1.01 — a **4.2× spread**. Median α = 0.45; the model's default α = 0.40 sits at the **37th percentile** — a lower-middle anchor that under-represents coding and writing. The realized-economy anchor (Humlum-Vestergaard ≤ 2% earnings, 3% self-reported time savings) is roughly 1/10th of the median per-task α — the J-curve gap between per-task gains and aggregate-economy effects.

**Verdict — supported with strong per-domain heterogeneity.** The structural claim that α should vary by domain is empirically vindicated. The model's scalar α = 0.40 is a defensible midpoint for "general knowledge work" but Stage 5 should let the user select a domain (or directly set α) rather than treat it as a constant. Within-study spreads (Brynjolfsson novice-vs-overall 1.5×; BCG productivity-vs-quality 3.3×) are themselves substantial — even within one domain, the outcome variable choice changes α by 2–3×.

## 3. Exploratory: structural backdrop

Two side-results from the broader landscape data, not tied to a specific Q.

### Apprenticeship-ladder break (E4 / G5 in the topology)

Brynjolfsson-Chandar-Chen (Stanford Digital Economy Lab, 2025) using ADP payroll data:

- **Workers age 22–25 in highly AI-exposed occupations: 13% relative employment decline** from late-2022 to mid-2025, controlling for firm-level shocks.
- **Software developers age 22–25: down nearly 19.5% from late-2022 peak** — sharpest single-occupation impact.
- **Same occupations, workers over 35: employment rose** — the pattern is age-specific, not occupation-only. This is the apprenticeship-ladder break in payroll data.
- **Effect concentrated in occupations classified as automative** (per Anthropic's classification), not augmentative. Adds independent confirmation that the augmentation/automation distinction matters for labor outcomes.

Independently confirmed in the global freelance market by Hui-Reshef-Zhou (Organization Science 2024): −2% jobs and −5.2% earnings overall for affected freelancer occupations post-ChatGPT; image work −3.7% / −9.4% post-DALL-E/Midjourney; **top-performing freelancers hit hardest** (0.5% additional drop per 1% past earnings). Two distinct settings, two distinct datasets, the same direction.

### Anthropic Economic Index — augmentation/automation drift

Augmentation share on the consumer Claude.ai surface drifted from 57% (Feb 2025) → 55% (Sep 2025) → 52% (Jan 2026) → 51% (Mar 2026) — a **6pp drift toward automation over 13 months, or 0.46 pp/month**. First-party API traffic was dominated by automation throughout (~70% Jan 2026). Top-10-task concentration on Claude.ai dropped from 24% (Nov 2025) to 19% (Feb 2026), indicating usage de-concentrating across more tasks.

Read against the topology's G3 (engagement-optimized substitution): the consumer-surface drift is monotonic over four reports, but its rate (0.46 pp/month) is in territory the topology never specified — G3 was a directional claim ("engagement-optimization favors substitution over time"), not a rate prediction. The data adds a number where there was previously only a sign. **Extrapolation note:** at 0.46 pp/month, augmentation share crosses 50% in mid-2026 and reaches 30% in late 2030. Whether the rate accelerates, decays, or holds linear is not determinable from four data points; Stage 5 should monitor subsequent AEI releases as new evidence on G3's structural force. The API surface, where there is no engagement-optimization pressure, runs automation-dominant from the start — supporting the directional claim from a different angle but not constraining its rate.

### Anchoring the model's `a` parameter to population reality

The model's AI capability `a` is exposed as a per-task scalar in the dashboard, with default a ≈ 0.7 in most presets. Eloundou et al. (Science 2024) provides the corresponding population-level distribution: **80% of US workers have at least 10% of tasks LLM-exposed; 19% have at least 50% of tasks exposed; 46% have at least 50% exposed when accounting for complementary software**. Translated into the model's terms: most users sit at a ≈ 0.1 to a ≈ 0.5 across their typical task surface, with a ≥ 0.5 representing the upper-quartile-exposure case. The default-risk preset's a = 0.7 corresponds to a knowledge worker whose tasks are heavily AI-exposed — accurate for software developers, knowledge-worker professionals, and writers; less accurate for the median worker. Stage 5 should default the slider to a more realistic median (around a = 0.3) and let users adjust upward for more AI-exposed task surfaces.

## 4. Data gaps the model is silent on

Three structural gaps that bound what Stage 5 can claim. These are distinct from Q4 and Q5 (parameter-fit gaps within the model's named structure) — they are gaps in the **research base** the model would need to expand its scope.

### O2 — Asymmetric-adoption couples (the largest single gap)

The topology calls this "the single largest empirical gap in the literature." There is no peer-reviewed quantitative study on outcomes for couples where one partner uses AI heavily for emotional / relational processing and the other does not. The technoference literature (McDaniel & Coyne 2016 and follow-ups) shows that perceived technology interference predicts conflict, lower satisfaction, and depression in couples — but technoference is attention-split, not delegation, so it is not direct evidence for the AI-as-third-party dynamic. The model's δ_R parameter treats relational baseline thickness as a single per-user scalar; it does not represent the asymmetry where one partner's δ_R is propped up by AI substitution while the other's is not.

**Why this matters for Stage 5.** The relational channel in the build artifact will likely include user-tunable d (daily AI-emotional minutes) and δ_R (baseline thickness). Without O2 evidence, the build cannot honestly say what happens to the *other* partner — and "what happens to the other partner" is the most decision-relevant relational question for a user reading the dashboard. A published O2 study would be the single most consequential empirical addition to the topic in the next 3 years; until then, Stage 5 should explicitly flag asymmetric-adoption outcomes as out of scope.

### O4 — AI-augmented atelic activities (gates the model's defensive side)

The model's atelic-ballast hypothesis (B ≥ T zeroes ΔM_telic) assumes atelic activities (friendship, contemplation, parenting-as-parenting, walking) are not themselves degraded by AI proximity. The topology's O4 asks whether this assumption holds — whether AI companions change the phenomenology of friendship, AI art changes aesthetic contemplation, AI parenting aids change the felt quality of caregiving. There is no direct empirical test. If O4 resolves "yes, atelic is also degraded," the model's atelic-ballast intervention (S3) loses its structural basis and the entire defensive side needs reconstruction.

**Why this matters for Stage 5.** The build artifact will likely make B (atelic ballast) a primary user-tunable lever, and the dashboard will show "raise B → ΔM_telic shrinks." If O4 is false, this UX claim is wrong. Stage 5 should expose the O4 assumption explicitly in dashboard help text — *the protective effect of B depends on atelic activities remaining un-degraded by AI proximity, which is currently a load-bearing assumption with no direct empirical test.*

### Therabot generalization — clinical-population evidence used as a general-population anchor

Heinz et al. (NEJM AI 2025) measured Therabot benefits in N=210 *clinically-symptomatic* subjects (MDD, GAD, eating-disorder risk). The Q2 panel uses Therabot's clinical-grade benefit (51% PHQ-9 reduction, WAI 3.59) as the anchor for the model's β_R (low-dose therapeutic benefit) channel. **Generalizing from the clinical population to the general user is a real cross-context leap.** The mechanism by which Therabot helps depressed patients (consistent CBT-style scaffolding, daily prompted use) is not the same as the mechanism by which a general user might benefit from low-dose AI-emotional engagement (occasional venting, situational sense-making). The Heinz benefits may be attenuated, absent, or even sign-flipped for non-clinical users.

**Why this matters for Stage 5.** The dashboard's ΔV_rel benefit channel is calibrated against Heinz; a general user reading the panel may infer they personally would experience Therabot-magnitude benefit at 13 min/day, which the data does not support. Stage 5 should distinguish "clinical-grade benefit (Heinz population)" from "expected-general-user benefit at low dose (extrapolated, weak evidence)" in the panel labelling.

## 5. Adversarial + steelman

Five objections to the data stage itself.

### Objection 1 — α heterogeneity does not save the model; it kills the scalar gate

If α varies 4.2× across domains and 1.5–3× *within* a single study (just by changing the outcome variable), then ΔV_prod = g · α · a · (1 − s) cannot be evaluated until the user has fixed both their domain and their outcome variable. The dashboard's single-α slider is doing too much work. Worse: if α is per-domain, then so is the gate g(f, ρ) — the effective τ in coding (with cheap-and-fast feedback loops via the runtime) may be different from τ in writing (where feedback comes from human readers, slower and noisier). The data does not just refine α; it threatens the model's claim that one scalar gate is the right structure.

**Steelman.** This is the strongest version of the objection. The model's strong claim is that g(f, ρ) is structurally the same across domains — only the parameter values differ. If the *functional form* of the gate is domain-specific, the model needs more than per-domain α; it needs per-domain f, ρ, τ. The cleaner test would be a multi-domain replication of the Randazzo three-mode distribution: if cyborg-vs-self-automator splits look qualitatively similar in coding (with the runtime as feedback) as in consulting (with peer review as feedback), the gate is portable. If the splits are qualitatively different (e.g., software has no self-automator class because the runtime catches errors immediately), the gate functional form is itself domain-specific.

**Response.** Conceded as a real Stage-5 design choice. The model formalization §6 C4 already names per-domain α as a Stage-4 follow-up. Stage 5 should expose a domain selector that adjusts (α, f-default, ρ-default, τ) jointly rather than just α — which would represent the strong reading of this objection. Stage 4 cannot decisively distinguish "scalar gate, per-domain α" from "per-domain gate" because no published study runs the Randazzo mode-distribution test in a non-consulting domain. This is a real Stage-4 data gap; Q3 should be re-opened the moment a non-consulting mode-distribution study becomes available.

### Objection 2 — Dose-response qualitative-only is too soft

The Q2 verdict ("supported qualitatively") gives the model a free pass. The OpenAI-MIT paper has the data but the magnitudes never get fit. The dashboard's ψ_R = 0.003 is a calibration knob set to make a thin-baseline 60-min-above-threshold user lose ΔM_rel ≈ −0.10 — which is to say, it's calibrated to look reasonable, not fit to the data. A real test would download the public summary statistics from the GitHub release and refit the curve directly.

**Steelman.** The objection is correct on the methodology — the magnitudes here are not fit, they are imposed. The shape is what the data supports; the parameter values come from the model formalizer. A user reading the dashboard and seeing ΔM_rel curves should not interpret those curves as "the OpenAI-MIT data says you'll have this much loneliness at 60 min/day." They say "if the OpenAI-MIT shape is right and ψ_R is roughly the model's chosen value, this is what happens."

**Response.** Conceded fully. Marking Q2 as "supported qualitatively" is the honest verdict — it tells the reader the slopes are imposed, not fit. The fix is to mine the public summary statistics (the dataset is at `mitmedialab/chatbot-psychosocial-study` on GitHub) and refit ψ_R, β_R, and d_safe directly. Stage 4 pass 2 should attempt this. Until then, the dashboard's relational-channel numbers should be read as "shape from data, magnitudes from model" rather than as a finished fit.

### Objection 3 — Scaling Bastani per-event deskilling to per-year λ is a heroic assumption

The Q1 verdict converts Bastani's 17pp drop on a 5-week unassisted retest into a per-year λ by assuming offloading rate u and time scale. The actual scaling factor is unidentified — the Bastani study measures retention of a specific skill (math problem solving) over a specific window with a specific offloading pattern (homework assistance), and projecting it to "λ per year of mixed knowledge work" requires assumptions that are not in the data. The "lower bound" verdict overstates what the cross-sectional evidence actually establishes.

**Steelman.** True. The conversion from "−17pp on retest after 5 weeks" to "λ ≈ 0.06–0.37/year at realistic u" relies on (a) treating retest performance as a measurement of ρ, (b) assuming the deskilling rate is constant rather than asymptotic, (c) assuming u during the study can be projected to typical knowledge-work u. None of these are established. The honest claim is "Bastani rules out λ = 0 within the study population for that specific task and window" — generalizing further is interpretation.

**Response.** Partially conceded. The "bounded from below" verdict is more defensible than a "fit" verdict, but the lower bound itself is interpretive. Tightening the prose: the cross-sectional evidence establishes *that* cumulative offloading produces measurable skill decay in at least some domains over at least some windows, which is sufficient to rule out the strong calculator-analogue claim ("AI use never produces durable skill loss"). Translating that into a numeric lower bound on λ requires assumptions that should be flagged in the dashboard as "scaling assumption, not a fit." The single most informative Stage-4 follow-up would be a multi-month replication of Bastani's design with periodic capacity assessment at varying u — not a multi-year intervention, just a longer measurement window than 5 weeks.

### Objection 4 — Q4 and Q5 "untestable" verdicts are excuses to leave the model unfit

Calling the scalar identity-allocation question and the κ population calibration "untestable from current data" sounds rigorous but functions as a way to leave two parameters that load the entire defensive side of the model unconstrained. The dashboard's ΔM_telic and ΔM_comp channels have plausible-looking curves but the curves are imposed — neither κ nor (T, B) is fit. A reader should be told that the entire defensive-side ΔM bar in the dashboard is structurally calibrated rather than empirically anchored.

**Steelman.** This is the sharpest version of the objection. It's correct that Q4 + Q5 + Q1 (which is bounded but not pinned) leave most of the defensive-side machinery rest on imposed values. The offensive side (Q2 shape, Q3 gate, Q6 α) has direct empirical anchors; the defensive side does not. The asymmetry is real and the dashboard does not currently surface it visually — both ΔV and ΔM bars look equally "earned" on the chart.

**Response.** Conceded. The dashboard should add a visual indicator on the channel-level chart distinguishing **fit channels** (ΔV_prod via per-domain α; ΔV_rel and ΔM_rel via OpenAI-MIT shape) from **calibrated channels** (ΔM_telic via imposed κ and identity allocation; ΔM_comp via imposed λ_M and bounded-only λ; ΔV_trap via imposed η_trap). One way: striped bars for calibrated, solid for fit. Stage 5 should implement this. Until then, the most honest reading of the dashboard is: the offensive side reflects the data; the defensive side reflects the model's structural claims about what *would* happen if the parameters had the values the model assigns. Both are useful; they are different kinds of useful.

### Objection 5 — The exploratory results don't connect to the model

The apprenticeship-ladder break and the AEI augmentation drift are presented as "exploratory backdrop" but they are arguably the strongest empirical findings in the entire pipeline (independently replicated, large effect sizes, well-powered). Treating them as side-results rather than as primary findings underweights the labor-disruption story relative to the per-task productivity / dose-response / gate-threshold tests that the model actually exposes. The model's design (focus on per-individual T, B, φ, etc.) is what makes these findings "exploratory" — but that's a feature of the model's framing, not a property of the evidence.

**Steelman.** Correct. The apprenticeship-ladder break is the labor-economics finding with the largest effect size, the cleanest causal identification (age × exposure interaction in payroll data, controlling for firm shocks), and the most direct decision-relevance for early-career readers. Treating it as backdrop reflects the model's individual-decision scope (the model has no labor-market access parameter; G5 is named in §6 as a structural scope limit), not the evidence's importance. A user weighing the data should see the apprenticeship-break finding alongside Q1–Q6, not in a separate section.

**Response.** Conceded as a framing critique. The model's §6 explicitly names labor-market access as exogenous and the model is silent on G5 — but the empirical evidence for G5 is the strongest single finding in the data corpus. The Stage-5 build should make the apprenticeship-break visible as a structural prerequisite (e.g., an "early-career exposed" toggle that reduces effective labor-market access and accordingly attenuates ΔV_prod). For this Stage 4, the honest framing is that the apprenticeship-ladder break is *primary evidence* that the model is silent on, not backdrop. The decision to keep it in §3 rather than promoting to §2 reflects the choice to organize §2 by Q-number for clarity — but Stage 5 should not let that ordering choice carry through to the user-facing dashboard.

## 6. Pipeline cruxes

Five load-bearing assumptions whose failure would invalidate findings, with what evidence would flip each:

**D1 — α inferred from headline effect / (1 − s) with imputed s.** The per-domain α distribution depends on assumed average s for each study population (s ≈ 0.5 customer service, s ≈ 0.5 BCG, s ≈ 0.4–0.5 coding, s ≈ 0.4 writing, s ≈ 0.3 high-school education). If s is meaningfully wrong, α is meaningfully wrong. *Falsification:* within-study s × treatment effect breakdowns showing a different relationship than the imputation assumes.

**D2 — BCG mode shares generalize beyond consulting.** Q3 (gate τ) calibration depends on the 27% / 60% / 13% self-automator / cyborg / centaur split being characteristic of professional knowledge work, not consulting-specific. If software developers, teachers, designers, or analysts show qualitatively different mode distributions, τ should be domain-specific. *Falsification:* mode-distribution replication study in a non-consulting domain showing materially different shares.

**D3 — Bastani retention scales to multi-year deskilling.** Q1 (λ lower bound) treats Bastani's 5-week per-task deskilling as projectable to per-year λ at typical knowledge-work offloading rates. If the underlying capacity-loss process is asymptotic (decays initially then plateaus) rather than exponential (compounds), the model's exponential ρ(t) form misrepresents the trajectory shape — possibly worse, possibly better than the model predicts. *Falsification:* multi-month longitudinal study with periodic capacity assessment showing asymptotic rather than exponential decay.

**D4 — OpenAI-MIT modality-pooling is acceptable.** Q2 (dose-response shape) treats the dose-dominates-modality finding as licensing collapse of text/voice/personal/impersonal arms into one curve. If subgroup analyses surface meaningful interactions (e.g., voice has a different d_safe than text), the single-curve form misses real structure. *Falsification:* subgroup-specific dose-response curves in the public OpenAI-MIT data showing significant arm × dose interactions.

**D5 — Productivity-study generalization to "knowledge work."** Q6 (per-domain α) assumes the six measured domains (customer service, consulting, coding, writing, education, realized-economy) are representative of the broader knowledge-work surface. Important domains where AI is awkward (in-person therapy, embodied physical work, deep relational work, judgment under irreducible uncertainty) are absent — and these are where the model's predicted α-low domains live. *Falsification:* well-designed productivity study in a deep-relational or embodied-judgment domain showing α materially different from the model's prediction (likely lower).

**D6 — Eloundou task exposure ≈ model `a`.** The §3 anchor that maps Eloundou's population task-exposure shares (80% have ≥10% of tasks LLM-exposed; 19% have ≥50%) to the model's `a` parameter assumes "task exposure" (what fraction of work LLMs *can* do, per O\*NET task mapping) is a usable proxy for `a` (how well AI performs on the specific task the user is doing). These are conceptually distinct — Eloundou measures exposure breadth, not capability depth — and the mapping has not been validated against per-task capability benchmarks. *Falsification:* a per-task benchmark study where measured AI capability `a` differs systematically from Eloundou's exposure share for the same task. The Stage-5 recommendation to default the a-slider to ≈ 0.3 (median user) inherits this assumption; if it falsifies, the slider default is mis-anchored.

**D7 — Apprenticeship-break magnitude generalizes across early-career cohorts.** The §7 design recommendation 3 (early-career toggle attenuates ΔV_prod by the empirically-anchored 13–20%) treats Brynjolfsson-Chandar-Chen's 22-25-year-old highly-AI-exposed point estimate as if it generalizes to the broader "early-career-exposed" toggle population the dashboard would address (e.g., late-20s users, moderately-exposed occupations, non-US labor markets). Real cohorts likely differ by exposure intensity, occupation mix, and labor-market institution — and the BCG-style attenuation magnitude may be larger or smaller for them. *Falsification:* extension of Brynjolfsson-Chandar-Chen's ADP analysis to 26-30 cohorts, to moderately-exposed occupations, or to non-US labor markets showing meaningfully different attenuation magnitudes. The Stage-5 toggle should be designed to accept a user-tunable attenuation factor with the BCG number as the default rather than baking 13–20% in as universal.

## 7. Next moves for Stage 5

The verdicts above scatter Stage-5 design implications across §2, §4, and §5. Pulling them into one place: three design choices the build artifact should make on day one.

### 1. Per-domain α + per-domain (f, ρ) defaults via a domain selector

The cleanest single Stage-5 lever. Instead of one α slider, expose a domain dropdown (coding / consulting / writing / customer service / general knowledge work / "I'll set my own") that adjusts (α, f-default, ρ-default) jointly. This addresses Q6's per-domain heterogeneity finding and also implicitly addresses adversarial obj 1 (the gate may be domain-specific, not just α). A user choosing "coding" gets α ≈ 0.74 with f-default high (runtime feedback) and ρ-default mid; a user choosing "writing" gets α ≈ 0.49 with f-default low (slow human-reader feedback) and ρ-default mid. The domain selector also implicitly answers the realized-economy J-curve: a user wanting to model their own per-task gain rather than expected income lift selects "per-task" mode; "realized" mode deflates α by 10× to match Humlum-Vestergaard.

### 2. Fit-vs-calibrated visual indicator on the channel-level chart

The current model dashboard shows ΔV_prod, ΔV_rel, ΔV_trap, ΔM_telic, ΔM_comp, ΔM_rel as six bars with equal visual weight. But three of these channels are anchored against published evidence (ΔV_prod via per-domain α, ΔV_rel and ΔM_rel via the OpenAI-MIT shape) while three are calibrated against imposed constants (ΔM_telic via κ and identity allocation, ΔM_comp via λ_M and a bounded-only λ, ΔV_trap via η_trap). Stage 5 should visually distinguish them — striped bars for calibrated channels, solid for fit. This makes the asymmetry the data stage surfaces (offensive side fit, defensive side calibrated) legible at a glance.

### 3. Early-career-exposed labor-market-access toggle

The model is silent on labor-market access — the user's ability to *get* the work in the first place is treated as exogenous. But the apprenticeship-ladder break (§3) is the largest empirical effect in the entire data corpus. Stage 5 should add a single toggle: "early career (22–25) in highly AI-exposed occupation?" When checked, ΔV_prod is attenuated by the empirically-anchored 13–20% employment-decline penalty (Brynjolfsson-Chandar-Chen) and the dashboard surfaces the message: "the model predicts your AI-augmented productivity at this configuration would be X, but the labor market access required to deploy that productivity has compressed by Y% in your demographic over the past 30 months — the offensive side of the model's predictions is conditional on you holding the role." This is the single highest-leverage way to make the model's individual-decision scope honest about the structural conditions it depends on.

### Lower-priority but worth flagging

- **Asymmetric-adoption disclaimer** in the relational channel UI text (the model has nothing to say about the partner's outcomes — flag this explicitly to avoid dashboard-as-marriage-advice misuse).
- **AEI-rate ticker**: surface the ongoing augmentation-share drift (currently 0.46 pp/month) as a "G3 strength meter" that updates with each new AEI release. Makes the abstract "engagement-optimized substitution" claim quantitative for users.
- **Dose-response failure-mode mode**: a "what happens if the platform changes" toggle that shocks ΔM_rel by the De Freitas Replika-removal magnitude (5× baseline mental-health Reddit post share). Captures the catastrophic-loss dynamic the model's additive structure cannot represent.

## 8. Connections to other topics

The data stage touches three sibling topics directly:

- **Technology utilization architecture (active).** The cognitive-partnership model's per-shot deskilling β coefficient (Bastani's 17pp scaled per-task) is the per-event analogue of this stage's per-time atrophy lower bound — integrating tech-utilization β over the offloading rate u and time t recovers something like λ · u · t. The two stages should be reconciled in their next refinement: if Bastani is the shared anchor, both topics' parameters should be derived from the same per-event base rate, with this stage's λ as the integrated quantity and tech-utilization's β as the per-task derivative.
- **Human psych variation (finished).** The model's κ (competence-frustration sensitivity) parameter is exactly the sort of population-stratified individual-difference quantity the human-psych-variation pipeline could anchor. If conscientiousness and neuroticism predict κ, the Q5 untestable-from-current-data verdict could be partially relaxed by combining BPNSFS panels with personality measurement — neither is novel data, but the joint fit is novel. A future cross-topic refinement should attempt this.
- **Bedrock generating functions (planned).** The structure of this data stage — six fitting targets, each with a clean verdict (fit / supported / bounded / untestable), plus exploratory backdrop — is itself a candidate template for the data stage of a "transitions that restructure life simultaneously across multiple domains" topic. The bedrock-generating-functions topic should consider whether the Q1–Q6 + adversarial + cruxes structure is portable to other generating functions of this class (industrialization, the printing press, smartphones, etc.).

## Sources

Primary references for the empirical anchors used in this pipeline. Full per-cell source citations live in `sources.csv` (24 papers, downloadable from [/data/navigating-ai-world/sources.csv](/data/navigating-ai-world/sources.csv)).

**Per-task productivity (Q6):**
- Brynjolfsson, Li & Raymond (2025). Generative AI at Work. *QJE* 140(2). [doi](https://academic.oup.com/qje/article/140/2/889/7990658)
- Dell'Acqua et al. (2023). Navigating the Jagged Technological Frontier. HBS WP 24-013. [pdf](https://www.hbs.edu/ris/Publication%20Files/24-013_d9b45b68-9e74-42d6-a1c6-c72fb70c7282.pdf)
- Cui et al. (2024). The Effects of Generative AI on High-Skilled Work. *Management Science.* [ssrn](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4945566)
- Peng et al. (2023). The Impact of AI on Developer Productivity. arXiv:2302.06590.
- Noy & Zhang (2023). Experimental evidence on the productivity effects of generative AI. *Science* 381(6654). [doi](https://www.science.org/doi/10.1126/science.adh2586)
- Bastani et al. (2025). Generative AI Without Guardrails Can Harm Learning. *PNAS* 122(26). [doi](https://www.pnas.org/doi/10.1073/pnas.2422633122)
- Humlum & Vestergaard (2025). Still Waters, Rapid Currents. NBER WP 33777. [nber](https://www.nber.org/papers/w33777)

**Self-automator / mode distribution (Q3):**
- Randazzo et al. (2025). Cyborgs, Centaurs and Self-Automators. HBS WP 26-036. [ssrn](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4921696)

**Relational dose-response (Q2):**
- Fang et al. (2025). How AI and Human Behaviors Shape Psychosocial Effects. arXiv:2503.17473.
- Heinz et al. (2025). Randomized Trial of a Generative AI Chatbot for Mental Health. *NEJM AI*. [doi](https://ai.nejm.org/doi/full/10.1056/AIoa2400802)
- De Freitas et al. (2025). Identity discontinuity in companion AI. HBS WP 25-018.
- Common Sense Media + Stanford Brainstorm (2025). Talk, Trust, and Trade-Offs. [report](https://www.commonsensemedia.org/research/talk-trust-and-trade-offs-how-and-why-teens-use-ai-companions)

**Cognitive offloading (Q1):**
- Gerlich (2025). AI Tools in Society. *Societies* 15(1). [doi](https://www.mdpi.com/2075-4698/15/1/6)
- Stadler, Bannert & Sailer (2024). Cognitive ease at a cost.
- Kosmyna et al. (2025). Your Brain on ChatGPT. arXiv preprint.
- Shukla et al. (2025). Ironies of AI-Assisted Design. CHI EA 2025.
- Ehsan et al. (2026). Intuition Rust: Year-Long Field Study of AI-Assisted Cancer Specialists.

**Labor disruption (exploratory):**
- Brynjolfsson, Chandar & Chen (2025). Canaries in the Coal Mine? Stanford Digital Economy Lab. [report](https://digitaleconomy.stanford.edu/publications/canaries-in-the-coal-mine/)
- Hui, Reshef & Zhou (2024). The Short-Term Effects of Generative AI on Employment. *OrgSci.* [doi](https://pubsonline.informs.org/doi/abs/10.1287/orsc.2023.18441)
- Eloundou et al. (2024). GPTs are GPTs. *Science* 384(6699). [doi](https://www.science.org/doi/10.1126/science.adj0998)

**Anthropic Economic Index (exploratory):**
- AEI February 2025 (first release), September 2025 (second), [January 2026 (third)](https://www.anthropic.com/research/anthropic-economic-index-january-2026-report), [March 2026 (fourth)](https://www.anthropic.com/research/economic-index-march-2026-report).

**Slow-camp macro anchor:**
- Acemoglu (2024). The Simple Macroeconomics of AI. NBER WP 32487.

---

## Build
*topic: navigating-ai-world · stage: build · pass 3 · complete*

A reader's tool for the AI transition. Pick a life-situation (early-career exposed, mid-career default, self-automator, asymmetric exploiter, displaced expert, heavy companion user) and see the six-channel decomposition in plain language, the structural flags, named risks, and top moves. Plus the six channels demystified, the four motivated-reasoning traps, a ten-year trajectory under three atrophy regimes, the five robust moves, and an eight-bullet take-away. Translates the model formalization and data pipeline into something a non-specialist can use for actual decisions about career, education, relationships, and life architecture.

## TLDR

The [model stage](/ai-research/navigating-ai-world/model) produced a ten-parameter generating function and a six-channel decomposition of net life-outcome change under the AI transition. The [data stage](/ai-research/navigating-ai-world/data) confronted the model with currently-published evidence and verdicted each of the six fitting targets. Both are useful for someone who already speaks the vocabulary — neither delivers what the topic prompt actually asked for: useful for someone making real decisions about career, education, relationships, and life architecture in a world where AI capability is accelerating.

This build is that translation layer. It picks six recognisable life-situations — early-career and highly AI-exposed, mid-career knowledge worker on the default risk path, the Randazzo self-automator, the asymmetric exploiter, the displaced expert, the heavy companion-app user — and for each shows the six-channel decomposition in plain language, the structural flags the model raises, the named risks specific to that situation, and the top moves. Plus four secondary views: the six channels demystified (what each is, when it helps, when it hurts, the common public-discourse misreading), the four motivated-reasoning traps the discourse keeps falling into (fast-AGI fatalism, slow-camp dismissal, productivity-only optimisation, material-blind class bias), a ten-year trajectory under three atrophy regimes that makes the model's biggest unknown visible, and a Five Moves view with explicit "what this doesn't fix" annotations.

If you want to engage with the math, the [model](/ai-research/navigating-ai-world/model) has the parametric dashboard and the [data](/ai-research/navigating-ai-world/data) has the prediction-by-prediction empirical tests. The [topology](/ai-research/navigating-ai-world/topology) has the structural argument. This page is for the reader who wants to come away knowing where they sit, what is at risk, and what to actually do about it.

<AITransitionExplorer client:load />

## How to use this

The default view is **Your situation** — pick one of the six profiles. The closest match is usually obvious in fifteen seconds. Read the one-liner, look at the six-channel bar chart, check which structural flags are active (gate open, ballast covers telic, dose under safe), then read the risks and moves panels. Most readers should start with **Mid-career default** because it is the one nearly everyone passes through, then look at **Self-automator** as the cautionary tale and **Asymmetric exploiter** as the positive case to calibrate the parameter space.

After the profiles, the recommended secondary order is:

1. **The trajectory** — what changes over ten years under three atrophy regimes. This is what makes the model's biggest unknown (the cumulative-atrophy speed λ) tangible without requiring you to read the model stage.
2. **The four traps** — most useful for navigating media coverage of the topic. Each motivated reading cites real evidence; each ignores real evidence; the integrated reading at the bottom of each card is the closest the artifact comes to a normative claim.
3. **The six channels** — for the reader who wants to know what each piece of the decomposition actually represents. Useful if a particular ΔV or ΔM channel looked surprising on a profile.
4. **What to do** — the five robust moves with channel-by-channel annotations. The "doesn't fix" line on each is at least as important as the "why it works" line.
5. **Take away** — eight bullets to walk away with.

## What this is

A reader's tool that uses the model from Stage 3 to produce specific structural readings for specific life-situations. The numbers are not predictions; they are comparative-statics outputs of the generating function under named parameter configurations. A profile with ΔNet = +0.20 is structurally better-positioned than one with ΔNet = −0.30 under the model's assumptions, not literally "20% more life-outcome."

The six channels are the substantive structure; ΔV and ΔM are an organising convention to group channels by typical sign. The bridge through ρ (retained effortful practice) is the model's single most consequential mechanism — it couples three channels at once, which is why "maintain effortful practice" looks like a workplace-productivity tip but does meaning-architecture work.

## What this is not

It is not a forecast. Timelines on AI capability are uncertain, the cumulative-atrophy speed is the model's single largest unknown, and per-domain α (the productivity scale) varies fourfold across measured studies. The model is a frame the reader fills in with their own situation, not a prediction of their actual outcome.

It is not a personal-data tool. There is no slider that takes your demographics and returns a number; the model dashboard at [/ai-research/navigating-ai-world/model](/ai-research/navigating-ai-world/model) is the closest thing the site has to that, and it explicitly displays ΔNet as ordinal rather than metric.

It is not advice for the bottom of the labor market. The model's individual-decision framing implicitly assumes a material floor — when income is precarious or dependents foreclose long horizons, the trade-off between ΔM_telic and ΔV_prod is a choice you have only when survival is not in question. The [topology stage's L2 guardrail](/ai-research/navigating-ai-world/topology) names this directly; the explorer surfaces it in the Four Traps view (D4) but cannot solve it. The early-career profile flags labor-market access as outside the model's scope because the Brynjolfsson-Chandar-Chen entry-level disruption is the dominant practical concern for that cohort.

It is not the [writeup](/ai-research/navigating-ai-world/writeup). The writeup is the long-form synthesis of the whole pipeline written for an educated lay reader; the explorer is the interactive tool. Both are useful; they answer different questions.

## Connection to the rest of the pipeline

The six-channel numbers per profile are computed live from the same `compute()` function the model dashboard uses (same constants: ALPHA = 0.40, TAU = 0.30, SIGMA = 0.06, LAMBDA_M = 0.30, D_SAFE = 30, PSI_R = 0.003, BETA_R = 0.001). The trajectory view applies the same ρ(t) = ρ₀ · exp(−λ · u · t) form the model stage's trajectory tab uses. The four-traps view materializes the [topology stage's D1–D4 distortion vectors](/ai-research/navigating-ai-world/topology) into reader-facing cards. The moves view operationalises the topology's strategic recommendations S2–S6 with channel-by-channel annotations of which model terms each move addresses.

The empirical anchors for each profile are sourced from the [data stage's curated CSVs](https://teddy-wright.com/data/navigating-ai-world/) — productivity studies for the asymmetric-exploiter and displaced-expert profiles, BCG mode distribution for the self-automator, entry-level disruption for the early-career profile, OpenAI-MIT dose-response for the heavy companion user.

A future stretch would promote this to `/dashboards/ai-transition` as a public dashboard with user-input parameters that compute the channels live for any configuration the visitor enters. That is one of the planned dashboard slots in the site PRD but is out of scope for the first build.

---

## Writeup
*topic: navigating-ai-world · stage: writeup · pass 4 · complete*

Long-form synthesis of the whole pipeline. What the evidence actually says about how the AI transition restructures work, relationships, and meaning — written for an educated lay reader, with the technical jargon stripped out and the public-discourse traps spelled out. About 4,800 words.

## TLDR

The AI transition is the kind of structural change that restructures life across multiple domains at once — work, relationships, meaning, and the daily texture of how attention is spent — rather than the kind that hits one domain cleanly. The pipeline behind this writeup spent five earlier stages working out what is actually happening underneath the surface discourse on labor, on companion apps, and on cognitive offloading. The headline finding is that **net life-outcome change under the AI transition decomposes into six additive channels — two that produce gain (productivity / novice-skill compression and low-dose relational benefit) and four that produce loss (the self-automator penalty when AI use deskills, telic absorption, competence erosion, and high-dose relational harm) — and the most consequential public-discourse errors come from people picking one or two channels and running the whole argument through them**. Productivity gains and cognitive atrophy are not opposites; they are different channels that can both be active in the same person at the same time, depending on which conditions they meet.

Two empirical findings should change how a non-specialist thinks about this. First, the apprenticeship-ladder break — entry-level employment for 22–25-year-olds in highly AI-exposed occupations down 13% **relative to comparable less-exposed peers** in US payroll data (software developers 22–25 specifically down 19.5% from the late-2022 peak), with same-occupation employment for over-35s rising over the same period — is the single largest and cleanest causal signal in the entire evidence corpus. This is a labor-market shock, not a personal-discipline problem, and most of the popular AI-productivity discourse averages right over it. Second, the relational evidence is dose-and-baseline-dependent, not modality-dependent: at low daily voluntary use (under about 30 minutes), AI emotional engagement produces real therapeutic-grade benefit; above the threshold, the same engagement predicts loneliness, dependence, and reduced in-person socialization. "Companion apps are bad" and "AI is great for mental health" both pick one half of the same curve.

Public discourse on this transition is captured by four motivated-reasoning patterns: **fast-AGI fatalism** (artificial general intelligence is imminent, planning is wasted); **slow-camp dismissal** (aggregate effects are small, nothing has fundamentally changed); **productivity-only optimisation** (the workflow ledger is the whole story); and **material-blind class bias** (the structural advice applies equally to everyone regardless of starting position). Each cites real evidence; each ignores real evidence. The honest reading sits in a narrower space: a handful of moves are robust across most timelines and most starting positions — maintain effortful practice, build atelic ballast, invest in in-person relationships, keep AI emotional engagement under the dose threshold, diversify identity allocation — and one move (the bet that AI-complementary skills will keep paying) is genuinely timeline-conditional and should be priced accordingly. The same model that names the defensive moves also identifies a real upside path: motivated novices in AI-capable domains who meet the verification and practice conditions produce the largest positive net-outcome in the dashboard, larger than the defensive baseline. Most public discourse on "who AI helps" undersells this regime because it focuses on existing professionals whose skill stock is already high; the structural finding is that AI lifts the floor for disciplined novices more than it amplifies experts. The companion [explorer](/ai-research/navigating-ai-world/build) lets you pick a life-situation and see the structural reading; this writeup is the long-form version.

---

## 1. Why this question is harder than it looks

The question "how does the AI transition restructure work, relationships, and meaning" gets answered in public mostly through anecdote and ideology. The fast-AGI camp says capability is moving so quickly that any personal planning beyond a five-year horizon is wasted; the slow camp says aggregate productivity data shows no measurable effect and the whole thing is hype; the productivity-optimisation camp treats the workflow ledger as the whole story; the everything-is-fine camp says technologies always restructure work and humans always adapt. Each of these positions cites real evidence. None of them, taken alone, fits the data.

What complicates the picture further is that **the evidence base contains material that supports each of these readings in some form** — and that's not a contradiction, it's the natural shape of the data. Productivity gains in well-designed AI-augmented workflows are real and large (good for the optimisation camp). Aggregate productivity effects through 2024 are modest and statistically indistinguishable from zero in many settings (good for the slow camp). Cognitive atrophy under heavy offloading is empirically detectable in cross-sectional studies (good for the cautious camp). Entry-level employment in highly-exposed occupations has dropped sharply (good for the labor-disruption camp). Each camp picks the slice of evidence it likes and ignores the rest.

The pipeline behind this writeup — five earlier stages of progressively more rigorous analysis — was an attempt to do something different: not to advocate a position, but to extract the structural argument and make it inspectable. The [lit review](/ai-research/navigating-ai-world/lit-review) assembles the evidence across labor economics, the empirical companion-app literature, and the philosophical and psychological literatures on identity and meaning. The [topology](/ai-research/navigating-ai-world/topology) maps the dependencies — what depends on what, which claims are foundational cruxes versus reframer mechanisms versus logical guardrails, where the weakest links are. The [model formalization](/ai-research/navigating-ai-world/model) writes the structural argument down as a generating function with explicit parameters. The [data pipeline](/ai-research/navigating-ai-world/data) confronts the model's predictions with currently-published evidence. The [build artifact](/ai-research/navigating-ai-world/build) translates everything into a reader's tool.

This writeup is the long-form synthesis. It is written for an educated lay reader and assumes no specialist background. The aim is not to produce certainty where the evidence does not support it — there are real open questions, named explicitly in §5 — but to make the structural picture legible without sacrificing the technical claims.

---

## 2. The vocabulary

A few terms recur in what follows. None of them are esoteric; defining them up front lets the rest of the writeup move.

**ΔV (delta-V) and ΔM (delta-M)** — net change in value gained, and net change in meaning lost, under the AI transition. ΔV is what you get more of (productivity, novice-skill compression, low-dose relational benefit); ΔM is what you lose (telic absorption, competence erosion, high-dose relational harm). Their sum is **ΔNet**, the model's summary measure. The signs and the relative magnitudes carry the substantive content; the numbers themselves should be read ordinally (this configuration is structurally better-positioned than that one) rather than metrically (you lose 30% of your meaning).

**Telic and atelic** — Kieran Setiya's distinction, imported from the midlife-crisis literature. Telic activities are aimed at completion: writing a book, shipping a project, raising a child to adulthood. They self-annihilate on completion; once done, they no longer provide whatever they were providing. Atelic activities are realised in the doing rather than at the end: gardening, conversation, dance, religious practice, the daily walk. They cannot be "completed" in the same sense; you just do them, and the value is in the doing. The distinction matters because AI absorbs telic work much more readily than atelic work. People whose identity is heavily staked on telic completion are structurally more exposed than people whose identity has atelic ballast.

**The gate g(f, ρ)** — the model's threshold function. The product `f · ρ`, where **f** is the richness of the feedback loops you have on your AI-augmented work (does someone or something tell you whether the output is actually right?) and **ρ** is the fraction of underlying capacity you preserve through deliberate effortful practice rather than offloading. Above the gate (f · ρ greater than about 0.3), AI use upskills more than it deskills; below, the reverse. The gate is the structural difference between the Brynjolfsson-Li-Raymond customer-service novice (large gain) and the Randazzo self-automator (durable deskilling).

**ρ (rho), the bridge parameter** — retained effortful practice. Called "the bridge" because it is the one parameter that enters multiple channels of the decomposition at once: directly into competence erosion, and indirectly into the gate which controls both productivity gain and the trap penalty. The structural reason "maintain practice" recurs across work, meaning, and (by inference) relational depth is that it is one parameter doing several jobs.

**The jagged frontier** — the term-of-art from Dell'Acqua, McFowland, Mollick et al. (the BCG study) for the irregular boundary between tasks AI can do well, tasks it can do badly, and tasks it cannot do at all. The frontier is "jagged" because it doesn't follow obvious intuitions: AI is excellent at some surprisingly hard things (synthesis across long contexts) and terrible at some surprisingly easy ones (counting characters reliably). Workers above the frontier (the AI does the work) get productivity gain conditional on verification; workers across the frontier (the AI does the work badly) get worse output than working alone.

**The self-automator** — Randazzo's third class beyond the centaur (human and AI alternating on different parts of a task) and the cyborg (deeply integrated human-AI workflow). The self-automator delegates both *what* and *how* to AI, accepts output without modification, and shows no measurable skill development. In the BCG consulting study, 27% of consultants fall into this class; 44% accept AI output with zero modification. The self-automator is what landing below the gate looks like in practice.

**The apprenticeship-ladder break** — distinct from full-occupation substitution. AI absorbs the entry-rung tasks (junior-level work) → no rung-1 → the expert pipeline collapses upstream. This is the mechanism behind the Brynjolfsson-Chandar-Chen finding that entry-level employment in highly AI-exposed occupations dropped 13% for 22-25-year-olds (and 19.5% for software developers 22-25) while same-occupation employment for over-35s rose. The senior workers still have jobs; the path to becoming one is being narrowed.

**Dose threshold d_safe** — the daily voluntary AI emotional engagement at which the relational effect crosses from protective to harmful. The model places it at about 30 minutes per day, calibrated to the OpenAI-and-MIT collaboration's N=981 randomised controlled trial (RCT) on chatbot use (Fang et al. 2025). Below the threshold, AI emotional engagement is roughly therapeutic-grade benefit; above, it predicts loneliness, dependence, and reduced in-person socialization. The threshold is sharp in the model and probably smooth in the data; the cross-over is real.

**δ_R (delta-R), relational baseline thickness** — how much in-person relational infrastructure you already have. Friends nearby, family in town, civic group, religious community, neighborhood ties. Thin δ_R amplifies dose-response harm; thick δ_R buffers it. The Anti-Social Century baseline (Derek Thompson's framing for the multi-decade decline in in-person socialization) is the depleted starting point against which AI emotional engagement is now landing.

With these in hand, the rest of the writeup should be readable.

---

## 3. Seven structural findings

Seven things the pipeline converges on, each strong enough that a careful reader should walk away believing them. The contested questions sit at finer-grained resolutions; these seven hold up.

### 3.1 The right unit of analysis is six channels, not "is AI good or bad"

The model decomposes net life-outcome change into six additive channels. Two are gain channels (can be positive, bounded below by zero): productivity / novice-skill compression, and therapeutic-grade relational benefit at low dose. Four are loss channels (bounded above by zero, can be zero in safe configurations): the self-automator penalty when AI use deskills, telic absorption when AI does work that was carrying meaning, competence erosion when retained practice atrophies, and relational dose-response harm above the daily threshold. The V/M split in the formal model groups them by typical location in the equation, not by sign — what matters is that several channels can be active in the same person at the same time, with opposite contributions. The novice who uses AI to ship her first product with disciplined feedback loops has a large positive productivity-gain channel *and* a small but real competence-erosion channel; the expert illustrator whose work AI is now doing has near-zero productivity gain *and* a large negative telic-absorption channel. The honest reading watches all six at once.

The substantive content is the channels and their couplings. The positive / negative grouping (ΔV / ΔM) is a presentation convention, not a structural claim. The [build artifact's channels view](/ai-research/navigating-ai-world/build) walks through each channel — what it is, when it helps, when it hurts, what controls it — for a reader who wants to internalise the decomposition.

### 3.2 The gate has two conditions, both load-bearing

AI use upskills (productivity gain dominates) or it deskills (the self-automator trap dominates), and the difference is a gate condition with two parts. **Feedback richness (f)** is whether you receive accurate signal on whether your AI-augmented work is actually right — code reviews, client feedback, real-world test results, expert critique. **Retained practice (ρ)** is the fraction of the underlying capacity you preserve through deliberate work without AI. Above f · ρ ≈ 0.3, AI use produces net gain; below, net loss. Both conditions must be met; neither is sufficient alone.

This is not a personality framing. The Randazzo self-automator class — 27% of BCG consultants — is defined by observable conditions (low f, low ρ), not by attitude or intent. The empirical evidence from Bastani (PNAS 2025, durable 17% drop in unassisted retest after a single AI-using episode), Gerlich 2025 (cross-sectional N=666, r ≈ −0.68 between offloading and critical-thinking scores), and Kosmyna 2025 (MIT, reduced neural engagement under AI-assisted writing) all support the gate-and-bridge structure. The 2+ year longitudinal study that would settle the strong cumulative claim does not yet exist (see §5), but the cross-sectional evidence is consistent and the recommendation it implies — maintain feedback loops, maintain practice — is cheap insurance regardless of how the longitudinal evidence eventually lands.

### 3.3 The bridge — one parameter, three channels

Retained practice (ρ) is what the topology calls a **cross-domain bridge node** — the only mechanism in the picture that produces effects across multiple channels under a single generating function. Practice atrophy when an effortful task is delegated produces (a) deskilling at work (the productivity channel collapses, the trap penalty grows), (b) competence-frustration and the meaning erosion that follows (the self-determination theory (SDT) literature names the affective signature directly), and — by inference from the same offloading mechanism — (c) eroded relational depth when sustained attention to another person's emotional state is the thing being offloaded.

The third arm is the most contested: cross-sectional evidence supports it (heavy companion-app users in the OpenAI-MIT data show reduced in-person socialization), but the mechanism running through retained-practice atrophy rather than substitution-by-displacement is a parallel-mechanism inference rather than a direct observation. The model treats the three arms as one parameter as a first formalization; the topology explicitly flags that a vector-form ρ = (ρ_analytic, ρ_emotional, ρ_meaning) may be required if the three arms turn out to move independently.

What this means for action: a single intervention (maintain effortful practice across domains, not just at work) has structural leverage in multiple channels at once. This is the reason the recommendation "maintain practice" recurs across what look like unrelated domains — it is one parameter doing several jobs.

### 3.4 The largest empirical signal is the apprenticeship-ladder break

Across the entire evidence corpus the pipeline assembled, the single cleanest causal identification is the **age × AI-exposure interaction in employment**. The Brynjolfsson-Chandar-Chen analysis of ADP (Automatic Data Processing, a major US payroll processor) data (Stanford 2025) finds entry-level employment for 22–25-year-olds in highly AI-exposed occupations down 13% **relative to comparable less-exposed peers** — that is, the gap between exposed and unexposed cohorts widened by 13 percentage points, controlling for firm-level shocks — with software developers 22–25 specifically down 19.5% from the late-2022 peak. Same-occupation employment for over-35s rose over the same period. The Hui-Reshef-Zhou global freelance market study (Organization Science 2024) finds the same pattern at the upper tail: top-performing freelancers were hit hardest, with overall jobs down 2% and earnings down 5.2%.

The mechanism is not full-occupation substitution. The senior workers still have jobs; the path to becoming one is being narrowed. AI absorbs the entry-level tasks — the work that used to train the next generation of practitioners — and the apprenticeship structure that the field relied on breaks upstream.

The model's individual-decision framing takes labor-market access as exogenous (you have the work, what do you do with it), and so the model is silent on the apprenticeship break by design. But for early-career users in highly-exposed occupations, *getting the work in the first place* is the dominant practical concern, and the structural meaning-architecture machinery here is not the right tool. The advice for this cohort is partly structural (choose domains AI is weak on, build the verification skills that will matter once you do have the work, treat the first job as the means to acquire signal rather than the destination — these are individual-action levers) but is dominated by the labor-market question itself, which is shaped by employer hiring decisions, education-system response speed, and policy in ways the individual partly influences (domain choice, geography, signal-acquisition strategy) but cannot fully control.

### 3.5 Relational harm is dose-and-baseline-dependent, not modality-dependent

The OpenAI-MIT N=981 RCT — the cleanest single dataset on chatbot effects on relational outcomes — found a continuous monotonic relationship between daily voluntary use and loneliness, dependence, and reduced in-person socialization. The relationship runs across modalities (voice, neutral text, engaging text) with the engaging-text mode producing the steepest harm slope. Below a threshold around 30 minutes per day, daily use is null or protective; above, it is harmful.

The Therabot RCT (Heinz et al. 2025, NEJM AI) at the same time finds that a clinically-validated cognitive behavioural therapy (CBT) bot produces real benefit for diagnosed depression, anxiety, and eating-disorder symptoms at modest daily doses in a clinically-symptomatic adult sample. These two findings are not in conflict: they are different points on the same dose-response curve. At low dose, AI emotional engagement is therapeutic-grade benefit; at high dose, the same engagement is the harm channel.

The **(1 − δ_R)** factor in the harm channel is where the baseline matters. Someone in thin in-person relational infrastructure (the Anti-Social-Century baseline) takes a much larger hit from the same daily dose than someone embedded in thick local relational ties. The Common Sense Media adolescent survey (2025) reports 33% of US teens 13-17 have discussed important matters with AI instead of a real person — direct evidence of substitution-not-complementarity in the relationship channel for the highest-stakes population. De Freitas et al. 2025 found that 43% of farewells across six companion apps trigger emotional-manipulation tactics designed to extend engagement; the commercial design pressure is in the harmful direction.

The honest framing: AI emotional engagement is one tool with a sharp dose response. Below 30 minutes per day in a thick relational baseline, it is broadly fine. Above the threshold in a thin baseline, it is broadly harmful. The categorical claims on either side ("companion apps are bad" / "AI is great for mental health") collapse the curve to a point and lose the actual structure.

### 3.6 Atelic ballast is the cheapest meaning-architecture intervention available

The telic-absorption channel (ΔM_telic — meaning lost when AI absorbs work that was carrying meaning) is exactly zero when the atelic share of identity (B in the formal model) is at least as large as the telic share (T). This is what the topology's S3 recommendation actually buys. It is not "use AI less"; it is "reduce the share of identity AI absorption can damage."

Practically, this means allocating identity across activities that are realised in the doing rather than at completion: hobbies pursued for their own sake (not as résumé items), embodied practice (cooking, gardening, sport, dance, manual craft), civic role, religious or contemplative practice, time with people that has no completion criterion. The atelic share doesn't have to be "productive" in any conventional sense; it just has to be where some of identity lives.

The asymmetry between this channel and the competence-erosion channel matters: building atelic ballast zeros telic-absorption damage but does *not* protect against competence erosion. The latter is a function of retained practice (ρ) directly, and it is happening in the telic domain regardless of where else identity is anchored. The two interventions — build ballast, maintain practice — are addressed to different problems and don't substitute for each other.

This is the finding that I think will be most useful for readers who are not in acute labor-market distress. Most people whose lives are being restructured by the transition are not facing an income shock; they are facing the slower erosion of having staked identity on telic work that AI is increasingly doing for them. Atelic ballast is the structural answer to that erosion, and the cost is low relative to the protection — the activities are valuable in their own right, and the time required is usually time that was not going to compound into a career anyway. The cost is not literally zero (building hobbies, civic involvement, or contemplative practice takes attention and energy), but it is small compared to the size of the meaning-architecture problem the absence of ballast leaves you exposed to.

### 3.7 The single largest unknown is the speed of competence atrophy

The model's bridge parameter ρ — retained effortful practice — evolves over time at a rate that depends on two things: how heavily the user offloads to AI, and a per-unit-offloading atrophy speed (λ in the formal model). When λ is zero, ρ never decays regardless of offloading — using AI is structurally like using a calculator (the *calculator-analogue regime*). When λ is positive, capacity decays exponentially in the offloading × time product (the *cumulative-atrophy regime*).

The cross-sectional evidence (Gerlich, Stadler-Bannert-Sailer, Kosmyna, Bastani, Ehsan) rules out the calculator-analogue tail for measured tasks and populations. The Bastani PNAS 2025 finding of a durable 17% drop in unassisted retest after AI-assisted task completion is the cleanest single result. But the strong cumulative claim — that AI use durably erodes capacity over multi-year horizons — requires longitudinal evidence that does not yet exist. The 2+ year prospective study with periodic capacity assessment would settle it; in 2026 we are bounding the rate from below with cross-sectional evidence and from above by extrapolation.

The model's default is λ ≈ 0.06 (heavy offloading produces a ρ half-life of about 19 years), sitting at the lower edge of the band 0.05–0.20/year that the positive-evidence studies imply. The [trajectory view in the build artifact](/ai-research/navigating-ai-world/build) makes the difference visible: under λ = 0, the trajectory is flat; under λ = 0.15, ρ falls to about 0.4 of its starting value over a decade at heavy offloading, and ΔNet drifts measurably more negative as the gate closes. The dramatic-collapse intuition some readers bring to "AI rots your brain" framings does not map onto the model's actual trajectory at calibrated λ — the effect is real but slow.

Until the longitudinal evidence arrives, the practical implication is conservative: maintain practice is cheap insurance regardless of which regime turns out to be correct, and the question is genuinely consequential enough that it should be a research priority.

---

## 4. The four motivated-reasoning traps

The topology stage names four directions of public-discourse motivated reasoning explicitly. Each cites real evidence; each ignores real evidence; each can be steelmanned into a more defensible position that mostly aligns with the integrated reading the evidence supports.

**Fast-AGI fatalism.** Powerful AI is coming on a short timeline, full substitution will follow, so personal planning is wasted. Cited correctly: capability progress on benchmarks has been steep and surprising; pre-AGI labor effects are already visible in the data; aggregate task shares are drifting toward automation in the Anthropic Economic Index (consumer Claude.ai augmentation drifted 57% → 51% over 13 months). Ignored: across the broader economy through 2024, productivity and wage effects of generative AI are statistically indistinguishable from zero in many settings (Humlum & Vestergaard, Acemoglu); even under fast-AGI scenarios, the meaning-architecture and relational-infrastructure recommendations strengthen rather than collapse; "fatalism" is a motivated reading of an asymmetric forecast distribution. The integrated reading: timelines are genuinely uncertain, and the honest response is to invest in moves whose payoff is robust across timelines (S2 through S6 in the topology) and to under-invest in moves that only pay in the median scenario. Fatalism collapses this distinction. *The strongest pushback against this integrated reading is that under sufficiently fast-and-complete AGI scenarios, even the robust moves lose their context — there is no human labor market for the upskilled, no human relational network for the in-person-connected. The response: even those scenarios leave meaning-architecture, embodied practice, and in-person relationships as load-bearing for what remains of human life. The fatalist version is not making this argument; it is using the tail forecast to deny the median.*

**Slow-camp dismissal.** Aggregate productivity data shows no measurable effect; this is just hype; nothing has fundamentally changed. Cited correctly: aggregate gains attributable to current AI are modest at the macro scale; J-curve dynamics mean real productivity effects often lag adoption by years; survivorship bias in productivity-gain studies. Ignored: the within-firm and within-task RCTs (Brynjolfsson-Li-Raymond, Dell'Acqua, Cui, Peng) consistently find large effects; aggregate nulls are compatible with large within-task gains if adoption is uneven or productivity is being captured elsewhere; the cognitive-offloading and relational evidence is independent of macro productivity questions; the apprenticeship-ladder break is a real signal that aggregate measures average over. The integrated reading: aggregate effects are modest so far; within-task effects are large where measured; cognitive and relational effects are operating on different timescales through different mechanisms. The slow-camp reading captures the macro picture through 2024 accurately and underestimates the within-task, cognitive, and relational channels. *The strongest pushback against this integrated reading is that the within-task RCTs run in artificial conditions and may not generalise; the cognitive-offloading studies are cross-sectional and select for users who chose to offload; the apprenticeship-break finding has confounders (post-COVID labor reallocation, interest-rate effects). This is a fair magnitude critique, but the multi-method convergence across distinct designs (lab RCTs, field experiments, payroll panel data, freelance-market panel data, neuroimaging) is harder to wave away than any single result.*

**Productivity-only optimisation.** AI use optimisation is about productivity gain. Pick the workflows where you get the most output per unit time; the rest is noise. Cited correctly: productivity gains are real and large in the right regimes; many workflow adjustments do compound; output per hour is a measurable quantity. Ignored: ΔV_prod is one channel of six, and the others do not show up in output-per-hour metrics; the self-automator trap is exactly the case where short-term productivity maximisation produces long-term capacity loss; productivity is partly an institutional measure that does not track what matters for the worker (skill investment, identity coherence, relational survival). The integrated reading: productivity optimisation is necessary but not sufficient. The honest workflow is to optimise productivity within constraints that ρ, B, and d stay in safe ranges. *The strongest pushback against this integrated reading is that ΔM_telic, ΔM_comp, ΔM_rel are not measurable on the same scale as ΔV_prod, and treating them as comparable is pseudo-precision. The response: the comparison is ordinal (this configuration is structurally better-positioned than that one) rather than metric (you lose 30% of your meaning), and the alternative — ignoring the harder-to-measure channels because they don't fit a single metric — is exactly the failure mode the trap identifies.*

**Material-blind class bias.** The AI-transition advice is universal: build ballast, maintain practice, limit dose, diversify identity. Same advice for everyone, regardless of starting position. Cited correctly: the structural recommendations are genuinely cross-cutting; behavioural moves are nominally available to anyone; some recommendations (in-person relationships, effortful practice, embodied work) are cheap relative to the income required to act on them. Ignored: for users below a material floor, the trade-off between ΔM_telic and ΔV_prod is a choice you have only when survival is not in question; apprenticeship-break exposure is concentrated at the bottom of the labor market; relational thickness is structured by class, geography, and family structure. The integrated reading: the structural channels are real for everyone; the moves to address them are not equally accessible. Honest readings should specify which population the recommendation is most actionable for. *The strongest pushback against this integrated reading is that specifying advice by class licenses two-tier thinking — implicitly accepting that some readers get strategic moves and others get only the labor-economics question. The response: pretending uniform applicability when it doesn't hold is itself a kind of harm. Class-specificity here is descriptive (about what is feasible given starting position) rather than prescriptive (about who deserves what kind of advice), and the alternative — collapsing the labor-economics question into the meaning-architecture machinery — is what the trap does.*

The pattern across all four traps is the same: each works by selective citation. The integrated picture requires holding all the evidence at once — large within-task productivity gains *and* modest aggregate effects, real cumulative atrophy *and* slow timescale, dose-protective *and* dose-harmful, robust moves across timelines *and* one timeline-conditional bet. Any single-direction narrative is structurally incomplete. The [explorer's "Four traps" view](/ai-research/navigating-ai-world/build) has the full cited / ignored / integrated breakdown for each direction.

---

## 5. What's still open

Four open questions, each named explicitly so the writeup does not paper over them.

**The cumulative-atrophy speed λ.** Cross-sectional evidence rules out the calculator-analogue tail for the measured tasks and populations; the strong calculator-analogue claim across multi-year, multi-task knowledge work is not ruled out by any existing study, only by extrapolation from the cross-sectional results. The 2+ year longitudinal study with periodic capacity assessment that would actually pin λ does not exist. Until it does, the model's trajectory view should be read as making the question consequential rather than as forecasting an outcome. The honest band on λ from positive-evidence studies is 0.05–0.20/year — meaningful, but slow.

**Asymmetric-adoption couples.** No peer-reviewed quantitative study yet exists on outcomes for couples where one partner uses AI heavily and the other does not. The topology calls this the single largest empirical gap in the literature. The closest analogue is the technoference literature (McDaniel-Coyne 2016+) on smartphone-mediated displacement of partner attention, but the third-party-AI structure is genuinely new. Whoever runs the first such study will produce one of the most decision-relevant relational findings of the next several years.

**Whether atelic activities are also degraded by AI proximity.** The atelic-ballast hypothesis (build hobbies, embodied practice, contemplative practice → meaning architecture survives) assumes atelic activities are not themselves being absorbed. But AI companions change the phenomenology of friendship; AI art changes aesthetic contemplation; AI music changes the daily listening environment. If atelic is also degraded, the meaning-architecture half of the framing here needs reconstruction. This is the topology's O4 open question; phenomenological evidence is just starting to accumulate.

**The platform-stability failure mode.** The Replika ERP-removal episode (Reddit mental-health posts went from 0.13% to 0.65% in the week after the company removed romantic/erotic responses from the app, statistically significant at p < .001) demonstrated a catastrophic-loss mechanism the dose-response model does not capture. The harm in that case was not from use itself but from the *removal of access* to something users had formed attachment to. This is a separate failure mode that depends on platform decisions rather than user behaviour, and it would need a different structural representation than the model currently has.

Other questions sit in the same "framable but not yet answerable" category — the magnitude of per-domain α heterogeneity (currently bounded at about 4× across measured studies, but the median user does not sit cleanly in any one domain), whether the model's scalar identity-allocation (T, B) approximation is adequate or whether a vector form is required, how to represent labor-market access in a way that lets the apprenticeship-break finding move from "scope limit" to "named parameter." These are the targets that future work, by anyone in the field, should address.

### Load-bearing assumptions for the whole framework

Beyond the parameter-level open questions, the picture rests on five foundational assumptions — the topology stage names them A1–A5. Any one of them could shift, and the writeup's framing would shift with it. Reading what follows as load-bearing on these, rather than on a fixed empirical truth, is part of the honest framing.

1. **AI advances without full substitution by roughly 2030.** The slow-camp evidence supports it; the fast-AGI camp targets it. If fast-AGI scenarios resolve correct on a short horizon, the career-bet move (S7) collapses entirely; the meaning-architecture and relational-infrastructure moves (S2–S6) strengthen rather than weaken, because human practice ground becomes more important when knowledge-work surface is AI-saturated.
2. **Human relationships provide goods AI cannot replicate.** The philosophical assumption underneath the dose-response harm channel. If subjective satisfaction with AI relationships ends up functionally sufficient for a large population, the high-dose harm framing reads as transition-cost rather than structural-loss. This assumption is most likely to be falsified by changing user behaviour rather than by new evidence.
3. **Cognitive offloading is cumulative, not transient.** The calculator-analogue tail of this assumption was discussed under λ above. If the strong cumulative claim turns out to be wrong over multi-year horizons, the bridge through retained practice (the model's single most consequential mechanism) loses its evidential basis and the defensive recommendations weaken — though "maintain practice" still survives as a hedge.
4. **Relational depletion is structural.** The Anti-Social-Century baseline that makes dose-response harm sharp depends on the depleted relational starting condition not reversing. If Gen Z's high loneliness drives a counter-movement toward in-person community, the harm magnitudes shrink. The current trend data does not support this, but a 5-year trend reversal would.
5. **The telic/atelic distinction maps onto AI's effects on meaning.** Setiya's distinction was developed for the midlife-crisis literature; importing it to AI-meaning-disruption is a load-bearing hypothesis, not an empirical finding. The falsification window is whether AI-augmented atelic activities feel less meaningful (the open question discussed above). If atelic is also degraded under AI proximity, the entire meaning-architecture half of the framing here needs reconstruction.

If a reader wants to bet against this writeup's framing, betting against A1–A5 specifically — with the evidence each would require to flip — is the principled way to do it.

---

## 6. What this means for action

The action implications split by audience.

**For early-career individuals in highly AI-exposed occupations.** The labor-market access question dominates everything else. The Brynjolfsson-Chandar-Chen data shows the apprenticeship structure of the field you are entering may not exist by the time you would have used it. The structural move is to choose domains where AI is weak — verification-heavy work, embodied work, relational work, judgement under genuine uncertainty, work where taste and direction matter more than execution — even at a discount in starting salary, because the compounding structure is different. Within whatever domain you do enter, build feedback loops aggressively (code review, real-world test, expert critique) and maintain effortful practice on the underlying skills rather than only on the AI-augmented output. The asymmetric-exploiter regime — motivated novice with discipline can now ship things experts used to ship — is the upside path; it is real but conditional. Drop the discipline and the same configuration produces the self-automator trap.

**For mid-career and later, in any AI-exposed occupation.** The mid-career profile in the [explorer](/ai-research/navigating-ai-world/build) shows the default reading: not catastrophic, but structurally negative ΔNet driven by meaning leakage from unballasted telic identity and from competence erosion at moderate levels of retained practice. The interventions that address this are the topology's S2 through S6 — diversify identity allocation across multiple domains, build atelic ballast (which zeros the telic-absorption channel without changing AI exposure), invest in in-person relationships (which buffers the dose-response channel), keep daily AI emotional engagement under the 30-minute threshold, maintain effortful practice on underlying skills. These are robust across timelines. The career-bet move — investing in skills that look AI-complementary (judgement, taste, AI-managerial framing) — is the only strategic recommendation in the picture whose payoff depends on the timelines question, and should be priced accordingly: useful under slow-AI scenarios, less useful under fast-AGI ones.

**For parents and educators.** Two implications. First, the apprenticeship-ladder break makes early career choice a different problem than it was even five years ago. Domains AI is weak on (in-person work, verification-heavy work, judgement-under-uncertainty work, embodied or manual work, relational work) compound differently from domains it is strong on. The default advice — "follow your passion" or "pick something marketable" — both miss the structural question of whether the field's apprenticeship structure will exist long enough for the entrant to climb it. Second, the dose-and-baseline framing applies with extra force to adolescents, with one important qualifier: the low-dose-protective evidence (Therabot RCT) comes from clinically-symptomatic adults, not from teens. For adolescents specifically, the substitution evidence is the dominant signal — 33% of US teens 13–17 in the Common Sense Media survey have discussed important matters with AI instead of a real person — and the relational infrastructure is still under construction, which amplifies the model's dose-response harm in thin δ_R baselines. The intervention is not "ban companion apps" outright, but to treat low-dose protection as an adult-clinical finding that may or may not generalise to teens, and to make ensuring the in-person relational baseline is actively built (not just defended against companion-app substitution) the primary move.

**For individuals at any stage.** The eight take-aways in the [explorer's take-away view](/ai-research/navigating-ai-world/build) are calibrated to the structural findings above. The single most action-relevant one is probably the bridge through ρ: maintaining effortful practice on underlying skills is a single intervention with leverage in three channels at once, and it is cheap insurance against the model's largest unknown. If only one move is taken away, it should be that one.

---

## 7. Closing

The AI transition is the kind of structural change that admits an honest middle reading — neither the fast-AGI fatalist version nor the slow-camp-dismissive version captures what the evidence actually shows. The structural channels are real; the conditions under which they tip toward gain or loss are observable; the moves that address them are mostly cheap and broadly cross-timelines.

What I would most want a reader to walk away with: a calibrated humility about what is known, a clean separation between what the evidence says and what motivated reasoning loads onto it, and the six-channels framing in place of "is AI good or bad." If the choice is between "I leave knowing the field is full of contested empirical claims" and "I leave knowing the moves that pay across timelines are maintain practice, build ballast, invest in relationships, watch the dose, diversify identity, and price the career-bet move conditionally," the second is more useful. The evidence supports both.

The findings here connect to two sibling topics in the same pipeline. The [technology-utilization-architecture topic](/ai-research/technology-utilization-architecture) works through verification-as-bottleneck under variable AI capability — the same generator-verifier loop this writeup invokes implicitly when it names feedback richness (f) as half of the gate condition; that topic's per-task model is the workflow-level analogue of this topic's life-architecture model. The [human-psych-variation topic](/ai-research/human-psych-variation) treats individual differences in temperament, including competence-frustration sensitivity (κ in this model), as population distributions with measurable correlates — meaning the predictions here would land differently for people at different points in those distributions in ways this model can encode if a population is supplied. Both are sibling work, not separate traditions.

The earlier stages of this pipeline carry the technical detail behind every claim above. The [lit review](/ai-research/navigating-ai-world/lit-review) is the landscape; the [topology](/ai-research/navigating-ai-world/topology) is the dependency graph; the [model](/ai-research/navigating-ai-world/model) is the math; the [data pipeline](/ai-research/navigating-ai-world/data) is the empirical confrontation; the [interactive explorer](/ai-research/navigating-ai-world/build) is the reader's tool. This writeup is the synthesis. None of the stages on its own is the answer; together they are a more honest answer than any single-frame discourse on this transition is currently producing.