Topology
Dependency graph of the lit review. Three categories of high-stakes node (foundational cruxes / reframer nodes / logical guardrails) plus weakest links, four variant views, three Stage-3 options, an objections section, and a glossary. Updated through 2024-2025 literature on AM correction, within-family GWAS at scale, PGS portability, Scarr-Rowe collapse, GEP replication, and missing-heritability closure.
TLDR
The lit review documents what the science says about psychological variation. This topology asks a sharper question: what depends on what? Strip the field down to its load-bearing structure and the picture is surprisingly clean. Three foundational assumptions — that twin/adoption methods give approximately valid variance decomposition (A1), that GWAS signal reflects real genetic effects rather than population structure or assortative-mating artifact (A2), and that a general factor of cognitive ability g exists as a real dimension of individual variation (A3) — sit upstream of most of the empirical and synthesis nodes in the graph; if any one of them flipped, large regions would have to be rebuilt. Everything else is either an empirical claim resting on these foundations, a methodological prerequisite that lets the foundations be tested, a logical necessity that constrains how the empirical claims can be interpreted, or a generating mechanism that explains why the empirical pattern looks the way it does.
The high-stakes nodes split into three categories — keeping them separate is the single most useful conceptual move in this topology. Foundational cruxes (A1 twin validity, A2 GWAS signal real, A3 g exists) are the assumptions that, if falsified, force rebuilding regions of the picture. Reframer nodes (G2/E6 passive rGE / genetic nurture; G6/E7 cross-trait assortative mating) don’t break the picture if reversed — they change what it means; their magnitudes are being actively quantified and their precise share of population-level “genetic” effects is the field’s most consequential open quantity. Logical guardrails (L1 variance-ratio definition; L4 Lewontin firewall) cannot be falsified — they can only be ignored, which is exactly how most public-discourse misuse of the field proceeds. Conflating these three types under a single label of “important findings” is a major source of bad-faith debate.
The field’s weakest links are not where public discourse focuses heat. Mainstream contests over “is heritability real” target settled findings (A1+E1 are robust); the 2025 whole-genome-sequencing work (Wainschtein et al. 2025, Nature) now closes ~88% of the pedigree-based heritability gap, so the “missing heritability” critique is also substantially answered. The actual fragile zones in 2026 are: (a) the generalization from candidate-GxE failure to all-GxE-is-small — partially holding the null (Allegrini 2020 for education, 2025 systematic review for depression) but the literature is still too young for confidence; (b) Scarr-Rowe has weakened further — Ghirardi et al. 2024 found 39/42 PGI×SES interactions in the opposite (compensatory) direction, so “deprivation suppresses heritability” is now evidence-thin; (c) the polygenic-score → mechanism inference (Plomin’s “causal” view vs. Turkheimer’s “weak explanation” view) remains genuinely undecided; (d) the magnitude of AM-correction across psychiatric cross-disorder rg estimates is now being actively addressed — Ma, Wang, Border et al. 2024 (LAVA-Knock) is the first method to systematically reduce xAM-induced bias, with the field-wide answer likely in 2–3 years. The Flynn-reversal cause and the Gender Equality Paradox mechanism remain open mechanistic questions, but they are open in a different way — the empirical patterns themselves are robust; only the explanation is contested. The 2025 GEP systematic review (Herlitz et al.) actually strengthened the pattern across personality, verbal abilities, episodic memory, and negative emotions.
This topology is the input to model formalization (Stage 3). The cleanest formalization target is the variance decomposition equation: V(trait) = V(direct genetic) + V(genetic nurture / indirect) + V(AM-induced LD) + V(shared environment) + V(measured non-shared) + V(stochastic) + 2·Cov(genes, environment) + interaction terms — with each term parameterized by trait, age, and population context, and with the AM and rGE terms being where current methodological revision is concentrated. The four variant views below (Vulnerability / Flow / Minimal / Politicization) read the same graph through different lenses to make the formalization choices easier.
The graph
All ~50 nodes and their dependencies. Click a node for detail; drag to rearrange.
Click a node for its claim and load-bearing weight; hover an edge for the relation type; drag to rearrange. The variant toggles read the same graph through different lenses (vulnerability, flow, minimal claim set, politicization).
How to read this graph
Every node in the lit review collapses to one of eight types. Edges between them carry one of seven relations. Together they make the structure inspectable.
Node types
| Code | Type | What it is |
|---|---|---|
| A | Foundational assumption | A claim the field cannot operate without; if false, large downstream regions collapse |
| M | Methodological prerequisite | A study design or estimation tool that must work for the empirical claims to be testable |
| E | Empirical claim | A specific measured finding with an effect size and replication status |
| L | Logical necessity | Follows from definitions or algebra; not empirically refutable |
| G | Generating mechanism | A causal process that explains a pattern (rGE, AM, niche-picking, critical periods) |
| S | Synthesis claim | An integrative statement combining multiple lower-level claims |
| O | Open question | Genuinely undecided with current methods or evidence |
| D | Distortion vector | Where motivated reasoning concentrates (typed by direction) |
Edge types
| Code | Edge | Meaning |
|---|---|---|
| dep | depends-on | If target collapses, source collapses |
| imp | implies | Logical implication |
| sup | empirically-supports | Evidence relation |
| conf | confounds / inflates | Artifact relationship (e.g., AM inflates rg) |
| mod | moderates | Changes magnitude (e.g., SES × heritability) |
| dev | develops-into | Temporal/developmental successor (temperament → personality) |
| corr | corrects | Within-family corrects between-family bias |
Weight scale (load-bearing weight, 1–5)
- 5 — crux node; collapse propagates across multiple sections of the lit review
- 4 — load-bearing within a section
- 3 — important but local
- 2 — corroborating
- 1 — decorative; could be removed without changing the picture
1. Node catalog
Each node carries: type code · weight · short claim · key citation · status. Status flags: ✓ (robust/replicated), ~ (partial/qualified), ? (contested), ✗ (refuted, kept as historical reference).
A — Foundational assumptions
| ID | Wt | Claim | Status |
|---|---|---|---|
| A1 | 5 | Twin/adoption methods provide approximately valid variance decomposition (EEA modestly violated but not fatally) | ✓ |
| A2 | 5 | GWAS signal reflects real genetic effects, not (only) population stratification or AM artifact | ✓ partial |
| A3 | 5 | A general factor of cognitive ability g is a real dimension of individual variation (positive manifold) | ✓ statistical / ? mechanism |
| A4 | 3 | Heritability findings apply to the population sampled, not to individuals or other populations (scope) | ✓ |
| A5 | 3 | Phenotypes are reliably and validly measurable across cultures and time | ~ |
| A6 | 3 | Most psychological variation is dimensional, not taxonic | ✓ |
M — Methodological prerequisites
| ID | Wt | Tool | Notes |
|---|---|---|---|
| M1 | 4 | Twin studies (MZ/DZ) at scale | Polderman 2015 meta: 14.5M pairs |
| M2 | 4 | Adoption studies, especially cross-cultural (Korean-American) | Sacerdote 2007; Beauchamp 2023 |
| M3 | 5 | GWAS at N ≥ 100k (ideally ≥ 1M for personality/EA) | Okbay 2022 (3M for EA) |
| M4 | 5 | Within-family designs (sibling FE, MZ-discordant, parent-offspring trios). Kong 2018; Okbay 2022 (EA, N=3M); Howe et al. 2022 Nature Genetics extended this to 178k siblings × 25 phenotypes — within-sibship estimates were systematically smaller than population estimates for height, EA, cognitive ability, depressive symptoms, smoking. The within-family approach is now mature beyond just educational attainment | ✓ |
| M5 | 4 | Polygenic scores (PGS) | Best R² ~0.16 for EA, ~0.10 for SCZ |
| M6 | 3 | Cross-trait LD-score regression for genetic correlations | Brainstorm 2018 |
| M7 | 3 | Mendelian randomization | For causal inference from observational data |
| M8 | 3 | Pre-registration & collaborative meta-analysis | Demolished candidate-GxE |
| M9 | 4 | Whole-genome sequencing (rare-variant capture). 2025 follow-up (UK Biobank ~500k, Wainschtein et al. 2025 Nature) captures ~88% of pedigree-based narrow-sense heritability across many traits (20% rare + 68% common variants). The “missing heritability” problem is now substantially resolved for many phenotypes | ✓ |
E — Empirical claims
Cognition / IQ:
| ID | Wt | Claim | Status |
|---|---|---|---|
| E1 | 5 | Mean trait heritability ≈ 0.49 across 17,804 traits (Polderman 2015) | ✓ |
| E2 | 5 | Shared environment C ≈ 0 for adult personality and most adult cognition | ✓ with exceptions (EA, religiosity, politics) |
| E3 | 4 | Wilson Effect: IQ heritability rises from ~0.20 (age 5) to ~0.80 (adulthood) | ✓ |
| E4 | 5 | Hyper-polygenic architecture: thousands of small-effect variants | ✓ (Turkheimer’s 4th Law, Chabris 2015) |
| E5 | 4 | Candidate-gene approach for psychiatric/personality traits failed (5-HTTLPR etc.) | ✗ original claims; ✓ collapse finding |
| E6 | 5 | Within-family PGS effects are ~½ population-level effects (genetic nurture) | ✓ for EA, BMI, height |
| E7 | 5 | Cross-trait assortative mating explains R²≈74% of variance in genetic correlation estimates | ✓ (Border 2022, Science) |
| E8 | 4 | Lead exposure 1–10 µg/dL → −6.2 IQ pts (Lanphear 2005) | ✓ |
| E9 | 4 | Each year of schooling adds ~3.4 IQ points, persisting into old age | ✓ (Ritchie & Tucker-Drob 2018) |
| E22 | 4 | Within-population heritability does not license between-population inference | ✓ logical |
| E23 | 4 | PGS prediction accuracy decays continuously along the genetic-distance continuum from training population (Pearson r = −0.95 between genetic distance and PGS accuracy across 84 traits, Ding et al. 2023, Nature). Reframes the older “discrete ancestry-group drop” picture (Martin 2019; Mostafavi 2020) | ✓ |
| E24 | 3 | Flynn Effect and its post-1990s reversal are both environmentally driven (within-sibship evidence) | ✓ pattern; ? cause |
| E25 | 2 | Scarr-Rowe: SES × heritability hypothesis (more genetic expression in higher-SES). Weakening further in 2024 — Ghirardi et al. 2024 found 39/42 PGI×SES interactions in education NEGATIVE (compensatory direction); only 1 significant positive. Pattern is now closer to “compensatory hypothesis holds, Scarr-Rowe fails” than to “context-dependent” | ✗ |
| E18 | 4 | Positive manifold: every cognitive test correlates positively with every other | ✓ |
| E26 | 3 | Childhood IQ → all-cause mortality: each 1-SD ≈ 24% lower mortality | ✓ (Calvin 2011) |
| E27 | 3 | Lifespan IQ stability: Lothian Birth Cohort age-11 → age-90 r ≈ 0.67 | ✓ |
| E28 | 3 | Severe iodine/alcohol/deprivation cause large asymmetric IQ effects | ✓ |
Personality / temperament:
| ID | Wt | Claim | Status |
|---|---|---|---|
| E29 | 4 | Big Five h² ≈ 0.40–0.60; cross-cultural replication for E/A/C | ✓ partial (Tsimane qualifier) |
| E30 | 4 | Cumulative continuity: rank-order stability rises to ~0.74 by midlife | ✓ |
| E31 | 4 | Maturity principle: mean-level ↑ in C, A, ES with age | ✓ |
| E32 | 3 | Temperament dimensions (Surgency, Negative Affectivity, Effortful Control) → adult personality | ✓ |
| E33 | 3 | Personality predicts mortality/divorce/income at magnitudes ≈ SES & cognition | ✓ (Roberts 2007) |
Sex differences:
| ID | Wt | Claim | Status |
|---|---|---|---|
| E10 | 4 | Multivariate Big Five sex difference: D ≈ 2.71 (~10% overlap) | ~ method-sensitive |
| E11 | 4 | People-things interest difference d ≈ 0.93 (largest in psychology) | ✓ |
| E12 | 3 | Mental rotation d ≈ 0.56–0.73 male advantage | ✓ |
| E13 | 3 | Math performance d ≈ 0.05–0.10 (essentially equal) | ✓ |
| E14 | 4 | Gender Equality Paradox: differences larger in egalitarian/wealthier countries. Herlitz et al. 2025 systematic review (54 articles, 27 meta-analyses) confirmed the pattern across personality, verbal abilities, episodic memory, and negative emotions — pattern replication has strengthened, not weakened | ✓ pattern; ? mechanism |
| E34 | 3 | Physical aggression d ≈ 0.40–0.60 male; ~95% of homicides male | ✓ |
| E35 | 2 | CAH girls show masculinized toy preferences; primate parallels | ✓ |
Psychopathology:
| ID | Wt | Claim | Status |
|---|---|---|---|
| E15 | 4 | All major psychiatric disorders highly heritable (h² 0.35–0.85) and hyper-polygenic | ✓ |
| E16 | 4 | Cross-disorder genetic correlations exist | ✓ existence; ? magnitude post-AM |
| E17 | 3 | A p factor (general psychopathology) fits cross-syndrome data | ✓ statistical; ? interpretation |
| E36 | 3 | Autism: common-PGS positively correlated with IQ; rare/de-novo drives ID-comorbid cases | ✓ |
| E37 | 2 | Critical-period plasticity (GABAergic, perineuronal nets) is mechanistically real | ✓ |
L — Logical necessities
| ID | Wt | Claim |
|---|---|---|
| L1 | 5 | Heritability is a population variance ratio; it does not partition individual phenotypes (mathematical form — A4 is the scope-of-claim sibling) |
| L2 | 4 | h² changes with environmental variance: hold genes constant, equalize environments → h² → 1 |
| L3 | 5 | Within-family designs control for between-family confounds (rGE, stratification, AM) |
| L4 | 5 | Within-population heritability provides no information about between-population mean differences (Lewontin). E22 in the empirical column is the applied form of this same point |
| L5 | 3 | Multivariate D ≥ max(univariate d) when component dimensions are positively correlated |
| L6 | 3 | Positive manifold permits both unitary-cause and emergent-network interpretations of g |
| L7 | 3 | Effect-size interpretation is scale-dependent (d=0.10 trivial in trait psychology, large in clinical) |
G — Generating mechanisms
| ID | Wt | Mechanism | Drives |
|---|---|---|---|
| G1 | 4 | Active rGE / niche-picking | E3 (Wilson Effect amplification) |
| G2 | 5 | Passive rGE. Wang 2021 / Isungset 2022 confirm indirect ≈ ½ direct genetic effect for EA. Nivard et al. 2024 found indirect genetic effects on offspring achievement extend beyond the nuclear family — dynastic / extended-family / community processes contribute, so the “parents transmit gene + correlated environment” framing understates the spread | E6 (genetic nurture); inflation of population-level h² |
| G3 | 3 | Evocative rGE | Heritability of “environments” (Kendler & Baker 2007) |
| G4 | 3 | Critical-period plasticity (GABAergic maturation) | Asymmetric environmental effects on early development |
| G5 | 4 | Assortative mating → LD induction | Inflates additive genetic variance, h² |
| G6 | 5 | Cross-trait AM → spurious genetic correlations | Confounds E16, E17 (p-factor) |
| G7 | 4 | Stochastic developmental noise | Dominant source of non-shared environment |
| G8 | 3 | Selection / niche construction across the lifespan | Bridges temperament → personality → outcome cascade |
S — Synthesis claims
| ID | Wt | Claim |
|---|---|---|
| S1 | 5 | ”Genes vs. environment” is the wrong frame; the system is tightly coupled (genome × rGE × AM × few large environmental insults × stochastic noise × culture × developmental unfolding) |
| S2 | 5 | Twin h² ≥ SNP h² ≥ within-family h² gradient quantifies AM/rGE/measurement inflation across estimation methods |
| S3 | 4 | Heritability ≠ destiny; high h² is compatible with large environmental shifts (height: h² ≈ 0.80, +10cm in a century) |
| S4 | 4 | Most “non-shared environment” is stochastic, not systematic — it accounts for ~50% of personality variance and is poorly characterized |
| S5 | 4 | Two parallel hierarchies (CHC for cognition, HiTOP for psychopathology) connected at the top by inverse g↔p genetic correlation |
| S6 | 4 | Developmental cascade: temperament (infant biological reactivity) → personality (adult social-cognitive layer added) → outcomes (mortality, attainment, relationships) with h² ↑ and shared-env ↓ across the lifespan |
O — Open questions
| ID | Wt | Question | Why it matters |
|---|---|---|---|
| O1 | 5 | Mechanistic interpretation of PGS: “causal genetic” (Plomin) vs. “weak explanation” (Turkheimer) | Determines what PGS prediction means |
| O2 | 3 | Cause of Flynn-effect reversal post-1990s | Empirical pattern robust (Bratsberg & Rogeberg 2018 within-sibship Norway). Mechanism still unsettled. Pietschnig et al. 2024 (Vienna 2005–2018 cohort) added a wrinkle: the positive manifold itself may be weakening — gains in some abilities aren’t tracking gains in others, suggesting the g-loading of the rise/fall is not constant. Hypothesized mechanisms (screens, reduced long-form reading, attention) circulate without empirical pinning |
| O3 | 4 | Causal mechanism behind Gender Equality Paradox | Innate-expression release vs. measurement artifact vs. confound — selection of explanation has political stakes |
| O4 | 5 | Between-population mean differences: any genetic component? | Currently scientifically unanswerable with available methods (PGS portability too poor, cross-ancestry GWAS at scale don’t exist). Honest position: unresolved, not settled in either direction |
| O5 | 3 | g architecture: latent common cause vs. emergent network (mutualism, van der Maas 2006) | Affects how interventions could in principle move g |
| O6 | 4 | What “non-shared environment” actually is: stochastic noise, immune/microbial, peer networks, epigenetic, measurement error | Largest unmodeled variance component in personality |
| O7 | 5 | Magnitude of AM-correction across the cross-disorder genetic correlation matrix | Active revision. Ma, Wang, Border et al. 2024 AJHG introduced LAVA-Knock — a local-genetic-correlation method that reduces xAM-induced bias. Methods to give the answer are now emerging, not just to flag the problem |
D — Distortion vectors (where motivated reasoning concentrates)
| ID | Direction | Targets | Failure mode |
|---|---|---|---|
| D1 | Blank-slate / environmentalist | A1, E1, E10–E14 | Dismiss twin studies wholesale; oversell transgenerational epigenetics; overstate stereotype threat; minimize sex differences via univariate-only framing |
| D2 | Hereditarian | L4, E22, E23, O4 | Ignore Lewontin; treat g-loadedness of gaps as evidence of genetic etiology; cite fringe admixture studies; ignore AM/rGE corrections to PGS |
| D3 | ”Gender similarities” minimization | E10, E11, E14 | Selective citation (math d=0.05) to imply no differences anywhere; obscure D=2.71 multivariate; minimize d=0.93 interest gap |
| D4 | Pop evpsych overgeneralization | E10–E14, A6 | Treat dimensional ds as taxonic; extrapolate small ds to categorical claims; overgeneralize from specific tasks to broad-domain claims |
2. Dependency cascade
The cascade reads from foundations up to synthesis, and from corrections back down to corrected claims.
Forward cascade (foundations → empirical claims → synthesis)
A1 ──dep──> M1, M2 ──sup──> E1, E2, E3, E25, E29
A2 ──dep──> M3, M5 ──sup──> E4, E6, E7, E15–E17, E22–E23
A3 ──dep──> E18 ──sup──> E26, E27 ──imp──> S5
A4 (scope) + L1 (form) ──guards──> interpretation of E1, E2, E3 and S3
A6 ──imp──> S5 (dimensional turn in psychiatry)
M9 ──corr──> E1 (closes missing-heritability gap)
M3 + M4 ──sup──> E6 (genetic nurture), E7 (xAM)
E5 (candidate-gene collapse) ──imp──> E4 (polygenic architecture confirmed by absence of large hits)
E1 + E2 + E3 + G1 ──imp──> S6 (developmental cascade)
E1 + E4 + G2 + G5 ──imp──> S2 (h² gradient by method)
E10 + E11 + E12 + E13 + L5 ──imp──> "small univariate, large multivariate" sex-difference picture
E14 + O3 ──imp──> mechanism-pending GEP
E15 + E16 + E17 ──imp──> S5 (HiTOP/p)
E22 + E23 + L4 ──imp──> O4 (between-pop unanswerable currently)
E1 + E2 + E4 + E6 + E7 + E8 + E9 + G1–G7 ──imp──> S1, S2 (integrated picture)
Backward / corrective cascade (newer evidence revises older claims)
G2 (passive rGE) ──corr──> E1 estimates (population-level overstates direct genetic)
G6 (cross-trait AM) ──corr──> E16 (some psychiatric rg's may be xAM artifact)
M4 (within-family) ──corr──> E6 magnitude (~½ of population PGS)
M8 (preregistration) ──corr──> E5 (collapsed candidate-GxE)
M9 (WGS) ──corr──> "missing heritability" interpretation
Distortion → target edges
D1 ──attacks──> A1, E1, E10, E11, E12, E14
D2 ──attacks──> L4, E23 (ignores), exploits A2 absent corrections from G2/G6
D3 ──attacks──> E10, E11 (selective univariate framing)
D4 ──attacks──> A6, L7
3. Where pressure concentrates
A common failure mode in this literature is to treat all high-stakes nodes as the same type of thing. They are not. The graph has three distinct categories of high-stakes node and one category of fragile claim — keeping these separate sharpens what the field actually needs to resolve.
3a. Foundational cruxes — falsification breaks regions of the picture
These are the empirical-or-methodological assumptions that, if wrong, force rebuilding large parts of the lit review.
A1 — Twin/adoption method validity. Carries Section 1 of the lit review; heritability-by-domain table; Wilson Effect. Robustness: HIGH (MZ-reared-apart, SNP-h² bypassing EEA, misperceived-zygosity all converge). Would flip if SNP-h² for psychological traits systematically converged on <0.05 — has not occurred.
A2 — GWAS signal is real (not artifact). Carries the PGS enterprise; genetic nurture estimates; cross-disorder pleiotropy; modern psychiatric genetics. Robustness: MODERATE-HIGH (within-family PGS effects are non-zero for EA, BMI, height — direct signal exists; AM/stratification inflation magnitudes still being quantified). Would flip if within-family PGS effects converged on zero across most traits.
A3 — g is a real dimension of cognitive variation. Carries Section 5 of lit review; predictive-validity claims; CHC structure; mortality/income predictions. Robustness: HIGH for g as a statistical regularity; MODERATE for g as unitary biological mechanism. Would flip if a broad cognitive battery had first-PC <15% or if interventions reliably moved one ability while lowering others. 2024 wrinkle: Pietschnig et al. 2024 reported the positive manifold may be weakening across recent cohorts — softly pressures A3 in a new way without refuting it.
3b. Reframer nodes — the answer is open and reshapes interpretation
These don’t break the picture if reversed; they change what the picture means. Their magnitudes are being actively quantified in 2024–2026 work. Conflating reframers with foundational cruxes is the most common conceptual error in pop-science treatments of this field.
G2 / E6 — Passive rGE / genetic nurture. Reframes the meaning of every population-level genetic estimate. Without G2, “genetic transmission” reads as direct biological causation; with G2, ~half is environmentally mediated by genetically-similar parents (Wang 2021 / Isungset 2022). Nivard et al. 2024 (Nat Hum Behav) showed indirect genetic effects extend beyond the nuclear family to dynastic / extended-family processes. The existence is robust; precise magnitude across all traits is still being quantified.
G6 / E7 — Cross-trait assortative mating. Reframes the cross-disorder genetic-correlation matrix and the p-factor’s interpretation. Border 2022 (Science) showed phenotypic cross-mate correlations explain R²=74% of variance in genetic-correlation estimates. Ma, Wang, Border et al. 2024 (LAVA-Knock) is the first method to systematically reduce xAM-induced bias. The share of any specific rg that is artifact vs. genuine pleiotropy is still pending.
3c. Logical guardrails — unfalsifiable but load-bearing for interpretation
These cannot be falsified — they are algebraic / definitional truths. They can be ignored, which is how most public-discourse misuse of the field happens.
L1 — Heritability is a population variance ratio, not an individual partition. Cannot be falsified. Public misreading of “70% heritable IQ” as “70% of any individual’s IQ comes from genes” is the failure of L1, not the science.
L4 — Within-population heritability does not license between-population mean inference (Lewontin firewall). Cannot be falsified — it is a logical/algebraic point. Can only be ignored. The empirical buttress today is E23 (PGS portability collapse along genetic-distance continuum, Ding 2023): even if you wanted to use within-pop methods to speak to between-pop differences, the methods don’t currently work.
3d. Decorative material (safe to compress)
Removable from the topology without changing the qualitative picture:
- E35 (CAH / primate toy preferences) — convergent evidence, not necessary
- E37 (specific GABAergic critical-period mechanisms) — biologically real, not load-bearing for the variation argument
- HEXACO Honesty-Humility specifics — incremental over Big Five
- Specific Dark-Triad subdimensions — D-factor synthesis (Moshagen 2018) carries more weight
- P-FIT brain network specifics — corroborate g but don’t establish it
- Yehuda Holocaust FKBP5 transgenerational findings — refuted/non-replicated; kept only as historical anchor for D1 distortion
- Specific candidate-gene findings (5-HTTLPR depression) — refuted; kept as historical anchor for the field’s methodological turn (M8)
4. Weakest links
These are the load-bearing pieces with the lowest current confidence. Targeted attack on any one would do the most damage to the integrated picture.
W1: Generalization from candidate-GxE failure to “all GxE is small” (E5 → broader claim)
Why fragile: The candidate-gene collapse is definitive. The extrapolation that polygenic-score × environment interactions are also small is an inductive leap, not a result. As of 2025, the picture is partially holding the null but not strengthening it. A 2025 systematic review of 56 PGS×E studies for depression found mostly null or small effects. A multivariable PGS×E study of educational achievement (Allegrini et al. 2020) found “no evidence that GxE effects significantly contributed to multivariable prediction.” UK Biobank work (2024) on distinct explanations of GxE shows that many apparent GxE signals are confounded by scale, ascertainment, or population structure. The candidate-gene-failure extrapolation is looking less like an inductive leap and more like a substantive empirical pattern — but the literature is still too young for a strong null.
Pressure test: Several large preregistered PGS×E studies finding interactions explaining >5% variance would substantially revise this corner of the picture.
W2: Scarr-Rowe (E25) — has substantially weakened since pass-0
Why fragile: The original meta-analytic picture was “replicates in US, fails in W. Europe / Australia” (Tucker-Drob & Bates 2016). Ghirardi et al. 2024 (Netherlands Twin Register, polygenic-index design across 42 PGI×SES interaction tests for educational outcomes) found 39/42 negative, 0 significant positive, 1 marginally significant positive — i.e., the opposite sign from Scarr-Rowe in most cases. The picture in 2026 is closer to “the compensatory hypothesis (more genetic expression in low-SES because constrained environments suppress non-genetic variance) is the better-supported pattern, at least for educational outcomes.” E25’s weight has been downgraded from 3 → 2 to reflect this. The narrative “deprivation suppresses heritability” — popular in policy discourse — is now evidence-thin.
W3: Plomin-vs-Turkheimer interpretation of PGS (O1)
Why fragile: Both views are compatible with current data. Determines what PGS means — direct biology vs. summary statistic of correlated environments. The field publishes ambiguously across both interpretations. Will likely be settled only by within-family-only PGS that are still well-powered.
W4: Magnitude of AM-correction across psychiatric cross-disorder rg matrix (O7) — methods now emerging
Why still fragile but improving: Border 2022 showed xAM explains R²=74% of variance in genetic-correlation estimates but didn’t prove all rg’s are spurious — some genuine pleiotropy surely exists. As of 2024, the field is moving from flagging the problem to building correction methods. Ma, Wang, Border et al. 2024 (American Journal of Human Genetics) introduced LAVA-Knock, a local-genetic-correlation method using knockoff inference to reduce xAM-induced bias; tested across 630 trait pairs in simulation and real GWAS, it substantially reduces but does not eliminate the bias. A 2024 study found AM genetic signatures across SCZ, BD, MDD, alcohol phenotypes, and Tourette syndrome — confirming xAM is not selective. What’s still pending: how much of the cross-disorder rg matrix and the p-factor genetic signal survives systematic application of AM-correction methods at scale. Likely answer in 2–3 years.
W5: Gender Equality Paradox mechanism (O3, E14) — pattern strengthened, mechanism still contested
Empirical pattern: more robust as of 2025. Herlitz et al. 2025 systematic review (54 articles, 27 meta-analyses, Perspectives on Psychological Science) found the paradox replicates across personality, verbal abilities, episodic memory, and negative emotions. Balducci et al. 2024 extended it to within-individual academic strengths cross-temporally. The “this won’t replicate” objection has weakened.
Mechanism: still contested. Three live candidates: (a) innate-expression release in resource-rich environments, (b) reference-group / self-anchoring artifacts in self-report (people compare to their gender peers, not to humans-in-general), (c) wealth/freedom confounds with gender equality. Behavioral / incentivized-measure replications (Falk & Hermle 2018 for economic preferences) cover only part of the domain. The decisive test — non-self-report behavioral replication across personality and interests — is still incomplete. Each candidate mechanism implies different normative conclusions, which is part of why this remains contested rather than resolved.
W6: Flynn-reversal cause (O2) — and a new wrinkle on the positive manifold
Why fragile: The pattern is environmentally driven (within-sibship, Bratsberg & Rogeberg 2018), so “dysgenic” explanations are out. But no mechanism (screen time, education quality, attention, nutrition, lead, microplastics) has been pinned down with within-cohort empirical work. Pietschnig et al. 2024 (Vienna 2005–2018 cohort) added a structural twist: the positive manifold itself may be weakening across cohorts — meaning the recent rise/fall is not uniformly g-loaded. If confirmed broadly, this softly pressures A3 (g exists as a stable dimension) — not refuting it, but suggesting its strength may be cohort-dependent. Still not load-bearing for the integrated picture, but interacts with A3 in a new way.
W7: A6 (dimensional vs. taxonic) at psychiatric extremes
Why fragile: Most psychopathology is dimensional (taxometric evidence is robust), but for severe early-onset autism with intellectual disability, rare large-effect variants (CHD8, SCN2A, SYNGAP1) drive a partly taxonic picture. The “all dimensional” framing oversells continuity at the severe tail.
5. Variant views
The same graph, read four ways.
Variant A: Vulnerability map — where does this break?
The vulnerability map is the union of the three foundational cruxes (§3a), two reframer nodes (§3b), two logical guardrails (§3c), and seven weakest links (§4). Together they describe the smallest set of pressure points whose movement would force restructuring of the integrated picture:
- Falsify A1: SNP-h² systematically <0.05 → twin-method discredited → Section 1 collapses
- Falsify A2: within-family PGS → 0 → modern psychiatric genetics collapses
- Falsify A3: positive manifold dissolves → Section 5 collapses
- Falsify G2: within-family PGS = population PGS → genetic nurture is null → Plomin direct-causal view wins (O1 resolves)
- Falsify G6 fully: AM correction barely changes rg matrix → cross-disorder pleiotropy is real
- Violate L4: cannot be falsified, only ignored — but its violation in public discourse is the largest single source of public confusion
If exactly one of these were to flip, the rebuild would be: A1→ rebuild Section 1 only; A2→ rebuild Sections 1, 3, 7 (~40% of lit review); A3→ rebuild Section 5 (~25%); G2/G6→ keep numbers, rewrite causal interpretation throughout.
Variant B: Flow map — how does causation propagate?
Causation in this system runs in two directions, both important.
Forward developmental flow (genome → outcomes):
Genome (polygenic + few rare large-effect)
│
├──> Temperament (infant biological reactivity: Surgency / NA / EC)
│ │
│ ├──> Active rGE / niche-picking ─────────┐
│ │ │
│ └──> Evocative rGE (eliciting responses) ┤
│ │
└──> Direct expression in brain development ─────┤
▼
Personality (adult)
│
├──> Attainment
├──> Relationships
├──> Health behaviors
└──> Mortality
Indirect / dynastic flow (parents’ genome → offspring environment → offspring outcome):
Parents' genome
│
├──> Parents' phenotype (income, vocabulary, parenting style, neighborhood choice)
│ │
│ └──> Offspring's rearing environment ────────┐
│ ▼
│ Offspring outcomes
│ ▲
└──> Transmitted alleles ───────────────────────────┘
The genetic-nurture finding (E6) says these two pathways have roughly equal magnitude for educational attainment. They are partially separable only via within-family designs (M4) or non-transmitted-allele PGS.
Cross-generational drift via assortative mating:
Mating choice (correlated on phenotype)
│
└──> LD induction among causal variants (G5)
│
├──> Inflated additive genetic variance
├──> Inflated h²
├──> Inflated cross-trait genetic correlations (G6)
└──> Inflated PGS prediction accuracy
Variant C: Minimal claim set — smallest set supporting the conclusion
The smallest collection of claims that yields the integrated picture (S1) is eight nodes:
- E1 — Mean trait h² ≈ 0.49 (heritability is real and substantial)
- E4 — Polygenic architecture (no master genes)
- E6 — Within-family PGS ≈ ½ population PGS (genetic nurture is real)
- E7 — Cross-trait AM is a major source of inflated genetic correlations
- E8 — A small set of large environmental insults have causal effects (lead, iodine, alcohol, deprivation, schooling)
- L4 — Within-pop ≠ between-pop (Lewontin)
- G7 — Stochastic developmental noise is the dominant source of non-shared environment
- A6 — Most psychological variation is dimensional, not taxonic
These eight together generate the qualitative integrated picture without requiring detailed effect-size tables, cross-cultural caveats, or specific candidate-gene history. The remaining ~50 nodes refine and corroborate but do not change the shape.
Variant D: Politicization map — where does motivated reasoning concentrate?
This is the variant most relevant to the topic framing (“a minefield of motivated reasoning on all sides”).
Distortion-to-target matrix:
| Distortion | Targets | Move | Counter-evidence |
|---|---|---|---|
| D1 Blank-slate | A1, E1, E10–E14 | ”Twin studies are flawed; differences are socialization” | SNP-h² (bypasses EEA), MZ-reared-apart, Su 2009 (d=0.93), CAH/primate convergence |
| D2 Hereditarian | L4, E22, E23, O4 | ”Group differences are genetic” | PGS portability collapse (E23); Lewontin (L4); cross-ancestry GWAS at scale don’t exist |
| D3 Gender-similarities | E10, E11, E14 | ”All differences are tiny (cite math d=0.05)“ | Multivariate D=2.71 (E10); people-things d=0.93 (E11); GEP (E14) |
| D4 Pop-evpsych | A6, L7, E10–E14 | ”Men are X, women are Y” (categorical from dimensional) | A6 (dimensional); L7 (effect-size context) |
Why all four distortions can target the same evidence base: the evidence base contains both large differences (people-things d=0.93) and trivial ones (math d=0.05) and strong heritability (h²=0.49) and large environmental insults (lead, schooling) and logical guardrails against between-group inference (L4). Any single-direction narrative requires selective citation. The integrated picture (S1) requires holding all of it at once.
Operational implication for the formalization stage: any model that only parameterizes the variance components without parameterizing the interpretation of those components will be silently captured by whichever distortion the reader is most prone to. The formal model needs to make L4, G2, G6, and the dimensional/taxonic distinction (A6) structurally visible, not just numerically present.
6. Topology → formalization handoff
What the next stage (model formalization) should pick up.
Ready for equations
-
Variance decomposition — fully specifiable now:
V(P) = V(A_direct) + V(A_indirect) + V(A_AM-LD) + V(C_residual) + V(E_measured) + V(E_stochastic) + 2·Cov(G,E) + V(GxE)
With each V parameterized by trait, age, population (US vs. Europe for E25), and method (twin / SNP / within-family). Cov(G,E) captures rGE; V(A_AM-LD) captures G5/G6; V(A_indirect) captures G2.
-
Method gradient (S2): twin h² ≥ SNP h² ≥ within-family h², with the gaps decomposable into AM, rGE, and rare-variant contributions. Parameterize as a function of estimation method.
-
Wilson-effect curve: h²(age) = a + b·log(age) or similar saturation form, with the slope driven by G1 (active rGE). Calibratable from Bouchard 2013 and Briley & Tucker-Drob 2013.
-
Multivariate sex-difference algebra: D² = (μ₁ - μ₂)ᵀ Σ⁻¹ (μ₁ - μ₂), with a worked example showing how D = 2.71 follows from moderate univariate ds and a positive-correlation covariance structure.
-
PGS-portability decay function: prediction accuracy as a continuous function of genetic distance from training population (Ding et al. 2023, Nature: r = −0.95 between genetic distance and accuracy across 84 traits).
Still at observation stage (formalization premature)
- O1 — Plomin/Turkheimer interpretation of PGS: not yet a formal disagreement, just a verbal one
- O3 — GEP mechanism: the algebra of “innate expression release” is not yet specified
- O6 — what non-shared environment is: no candidate decomposition
- O7 — share of cross-disorder rg that survives AM correction: empirical question pending — but methods are now emerging (LAVA-Knock); answer likely in 2–3 years, at which point this moves to “ready for equations”
Connection to adjacent topics in the LLM-iterate pipeline
This topology is the natural input to Parent-to-Child Transmission (planned topic). The genetic-nurture finding (G2/E6) and the dynastic-extension finding (Nivard 2024) are the empirical answers to “how much does parenting matter beyond genes” that the parent-child topic will need to build on. When that topic spins up, the variance decomposition equation here should be its starting point.
Less directly: the Evolution-Modernity Mismatch topic will lean on the GEP (E14/O3) and Flynn-reversal (O2) findings as evidence of environment-driven shifts in expressed psychological variation. Bedrock Generating Functions can read the variance decomposition itself as one such bedrock function.
7. Next moves — three options for Stage 3
The user picks one of these as the primary formalization target. Each leaves the others viable as later modules but shapes Stage 4 (data) differently.
Option A — Variance decomposition + method gradient (most central)
Build the central equation V(P) = V(A_direct) + V(A_indirect) + V(A_AM-LD) + V(C) + V(E_meas) + V(E_stoch) + 2·Cov(G,E) + V(GxE) parameterized by trait, age, population, and estimation method. Build a tool that takes a published h² estimate (twin, SNP, or within-family) and outputs a method-corrected estimate with explicit AM/rGE adjustment.
Pros: most central to the topic; directly answers “what generates psychological variation”; feeds Stage 4 cleanly (every term has published estimates somewhere). Cons: many parameters; risk of producing a calculator nobody uses without strong UI judgment. Stage 4 implication: pull h² estimates from PGC, SSGAC, GIANT consortia; calibrate the method-gradient term per trait class.
Option B — Multivariate sex-difference algebra (most pedagogically clean)
Formalize how moderate univariate Cohen’s ds combine into a large multivariate Mahalanobis D, with a worked Big-Five example showing how D ≈ 2.71 emerges from |d| ≈ 0.4 ds and a positive-correlation Σ. Build a dashboard letting the user dial univariate ds and the correlation matrix to see D move.
Pros: tightly scoped; resolves the single biggest framing trap in the GEP debate (univariate vs. multivariate framings of the same data); high pedagogical leverage. Cons: narrower than A; doesn’t engage the heritability core. Stage 4 implication: pull effect-size matrices from Del Giudice 2012 and Schmitt 2008 cross-cultural data; replicate D under different correlation structures.
Option C — PGS-portability calibration (most practically useful)
Turn Ding et al. 2023’s continuous decay finding into a usable accuracy estimator: enter an individual’s genetic distance from the PGS training population and get an accuracy-decay multiplier. Apply across the major trait PGSs (EA, SCZ, BMI, etc.).
Pros: directly addresses a real-world bias; smallest scope; ships fastest; useful even outside this project’s domain. Cons: less central to the heritability question; might fit better as a tool than a topic-stage. Stage 4 implication: pull cross-ancestry GWAS validation data from the All of Us / GenomeAsia / H3Africa consortia.
My recommendation: A as primary (most central to the topic’s stated purpose), with B as a stretch module if scope allows. C is high-value but might better live as a standalone tool promoted to /models later.
8. Objections to this topology (adversarial + steelman)
Four ways a careful reader could push back. The strongest version of each, then my response.
Objection 1 — Discrete typed edges falsify a continuous, magnitude-weighted, context-conditional system
Heritability is not “supported by” a twin study in the same binary way that a logical implication holds. The system is a tightly coupled developmental process; flattening it into nodes-and-arrows with discrete edge types loses information about magnitude, conditional dependence, and gradient relationships.
Response: Acknowledged, and intentional. The topology is the qualitative skeleton; edge weights and conditional dependencies are the job of Stage 3 (formalization), where each edge will be turned into a parameterized function. The graph’s value is not that it stands in for the full system but that it makes the structure visible cheaply enough that the formalization knows where to put the parameters.
Objection 2 — The crux/decorative split is editorial, not empirical
There is no algorithm that picks crux nodes; the choice depends on which failure modes you are worried about. A 1990s topology of this field would have crowned candidate-gene findings as cruxes. Naming A2 (GWAS signal real) a crux today is a judgment call about the field’s current methodological commitments — not an objective feature of the science.
Response: Correct. Cruxes are time-stamped. This topology is a 2026 snapshot. If the field shifts (post-AM-correction era, post-within-family-PGS-at-scale era) the crux set will shift — that is what the refinement passes are for. Use this as a current map, not an immutable structural claim.
Objection 3 — Calling L4 a “logical firewall” overstates the case
Lewontin’s 1970 argument has been challenged. Edwards (2003) “Lewontin’s Fallacy” showed that Lewontin’s specific quantitative point — that ~85% of human genetic variance is within rather than between populations — does not preclude reliable population-classification from genetic markers. Modern population genetics treats between-population genetic inference as more nuanced than the firewall framing suggests.
Response: The Edwards critique is real, but it addresses a different claim. Edwards refuted “you cannot reliably classify individuals into populations from genetic data.” The L4 firewall as I formulate it says “within-population heritability provides no information about between-population mean differences without strong auxiliary assumptions about shared causal architecture and equal environments.” Those are different propositions. PGS portability collapse (E23) is the contemporary empirical evidence that the auxiliary assumptions are not currently being met for psychological traits. The firewall framing survives the Edwards critique; its strength rests on the empirical PGS-portability finding, not on the original Lewontin variance argument alone.
Objection 4 — The Politicization variant is meta-commentary, not topology
The D nodes and attacks edges describe how people misuse the evidence base. That is epistemics or sociology of science, not structural topology of the field. A pure topology should omit them.
Response: Fair, and the inclusion is non-orthodox. It is justified here only by the topic framing — the user’s prompt explicitly described the field as “a minefield of motivated reasoning … where the actual generating functions are obscured by politics.” A topology of just the science would omit the D nodes; a topology that helps a reader navigate the field as it is actually encountered should include them. The D nodes will not be carried into Stage 3 formalization — they exist for navigation, not for downstream computation.
9. Glossary
For readers approaching this from outside the field. Terms appear throughout the lit review and topology; this is the lookup table.
| Term | Meaning |
|---|---|
| h² | Heritability — fraction of trait variance in a population attributable to genetic variation. A population statistic, not an individual one. |
| SNP | Single-nucleotide polymorphism — a single-base difference at a position in the genome where multiple variants exist in the population. |
| GWAS | Genome-wide association study — scans hundreds of thousands of SNPs against a measured trait, looking for statistical association. |
| PGS | Polygenic score — a per-individual sum of trait-associated SNPs weighted by their GWAS effect sizes. Used as a predictor. |
| LD | Linkage disequilibrium — non-random association between alleles at nearby loci, typically because they are inherited together. |
| AM | Assortative mating — partners resemble each other on a trait above chance. xAM = cross-trait AM (e.g., taller-than-average partners with more-educated-than-average). |
| rGE | Gene-environment correlation. Passive (parents transmit genes + correlated environment), evocative (heritable traits elicit responses), active (people select environments matching propensities). |
| GxE | Gene-environment interaction — the same genotype produces different phenotypes in different environments. |
| EEA | Equal environments assumption — the twin-method assumption that MZ and DZ twins are treated similarly enough that any extra MZ phenotypic resemblance reflects genetics, not differential treatment. |
| MZ / DZ | Monozygotic (identical, ~100% shared DNA) / dizygotic (fraternal, ~50% shared DNA) twins. |
| rg | Genetic correlation between two traits — how much the same genetic variants influence both. |
| WGS | Whole-genome sequencing — capturing every base in the genome, including rare variants GWAS misses. |
| g-factor | General factor of cognitive ability — the latent dimension behind the positive manifold (every cognitive test correlates positively with every other). |
| p-factor | Proposed general factor of psychopathology — analogous to g, derived from cross-syndrome correlations. |
| CHC / HiTOP | Cattell-Horn-Carroll cognitive-ability hierarchy / Hierarchical Taxonomy of Psychopathology (a dimensional alternative to DSM). |
| d (Cohen’s d) | Standardized mean difference between two groups, in standard-deviation units. Effect-size labels (small / medium / large) are scale-dependent — see L7 in the node catalog. |
| Mahalanobis D | Multivariate generalization of Cohen’s d — distance between two group means in the geometry of the trait space, accounting for correlation between traits. |
| Within-family design | Comparing siblings or MZ-discordant twins or parent-offspring trios within the same family — controls for between-family confounds (population stratification, AM, passive rGE). |
| Genetic nurture | Effect of parents’ genotype on offspring outcomes via the environment the parents create — including alleles the parent did not transmit. |
10. Stage_outputs convention reference
Raw working drafts from each LLM-iterate stage live at:
stage_outputs/<topic>/<stage>.md
Where <topic> is kebab-case (e.g., human-psych-variation) and <stage> is one of: lit-review, topology, model, data, build. Polished versions move into src/content/ai_research/<topic>/<stage>.mdx with proper frontmatter (title, description, date, status, refinementPass, refinementLog) once ready to publish on the site.
The interactive D3 graph for this topology lives at src/components/research/PsychVariationGraph.tsx and is mounted in src/content/ai_research/human-psych-variation/topology.mdx via client:load.