Topology in-progress pass 3

Topology

Dependency graph of the lit review. Three categories of high-stakes node (foundational cruxes / reframer nodes / logical guardrails) plus weakest links, four variant views, three Stage-3 options, an objections section, and a glossary. Updated through 2024-2025 literature on AM correction, within-family GWAS at scale, PGS portability, Scarr-Rowe collapse, GEP replication, and missing-heritability closure.

TLDR

The lit review documents what the science says about psychological variation. This topology asks a sharper question: what depends on what? Strip the field down to its load-bearing structure and the picture is surprisingly clean. Three foundational assumptions — that twin/adoption methods give approximately valid variance decomposition (A1), that GWAS signal reflects real genetic effects rather than population structure or assortative-mating artifact (A2), and that a general factor of cognitive ability g exists as a real dimension of individual variation (A3) — sit upstream of most of the empirical and synthesis nodes in the graph; if any one of them flipped, large regions would have to be rebuilt. Everything else is either an empirical claim resting on these foundations, a methodological prerequisite that lets the foundations be tested, a logical necessity that constrains how the empirical claims can be interpreted, or a generating mechanism that explains why the empirical pattern looks the way it does.

The high-stakes nodes split into three categories — keeping them separate is the single most useful conceptual move in this topology. Foundational cruxes (A1 twin validity, A2 GWAS signal real, A3 g exists) are the assumptions that, if falsified, force rebuilding regions of the picture. Reframer nodes (G2/E6 passive rGE / genetic nurture; G6/E7 cross-trait assortative mating) don’t break the picture if reversed — they change what it means; their magnitudes are being actively quantified and their precise share of population-level “genetic” effects is the field’s most consequential open quantity. Logical guardrails (L1 variance-ratio definition; L4 Lewontin firewall) cannot be falsified — they can only be ignored, which is exactly how most public-discourse misuse of the field proceeds. Conflating these three types under a single label of “important findings” is a major source of bad-faith debate.

The field’s weakest links are not where public discourse focuses heat. Mainstream contests over “is heritability real” target settled findings (A1+E1 are robust); the 2025 whole-genome-sequencing work (Wainschtein et al. 2025, Nature) now closes ~88% of the pedigree-based heritability gap, so the “missing heritability” critique is also substantially answered. The actual fragile zones in 2026 are: (a) the generalization from candidate-GxE failure to all-GxE-is-small — partially holding the null (Allegrini 2020 for education, 2025 systematic review for depression) but the literature is still too young for confidence; (b) Scarr-Rowe has weakened further — Ghirardi et al. 2024 found 39/42 PGI×SES interactions in the opposite (compensatory) direction, so “deprivation suppresses heritability” is now evidence-thin; (c) the polygenic-score → mechanism inference (Plomin’s “causal” view vs. Turkheimer’s “weak explanation” view) remains genuinely undecided; (d) the magnitude of AM-correction across psychiatric cross-disorder rg estimates is now being actively addressed — Ma, Wang, Border et al. 2024 (LAVA-Knock) is the first method to systematically reduce xAM-induced bias, with the field-wide answer likely in 2–3 years. The Flynn-reversal cause and the Gender Equality Paradox mechanism remain open mechanistic questions, but they are open in a different way — the empirical patterns themselves are robust; only the explanation is contested. The 2025 GEP systematic review (Herlitz et al.) actually strengthened the pattern across personality, verbal abilities, episodic memory, and negative emotions.

This topology is the input to model formalization (Stage 3). The cleanest formalization target is the variance decomposition equation: V(trait) = V(direct genetic) + V(genetic nurture / indirect) + V(AM-induced LD) + V(shared environment) + V(measured non-shared) + V(stochastic) + 2·Cov(genes, environment) + interaction terms — with each term parameterized by trait, age, and population context, and with the AM and rGE terms being where current methodological revision is concentrated. The four variant views below (Vulnerability / Flow / Minimal / Politicization) read the same graph through different lenses to make the formalization choices easier.

The graph

All ~50 nodes and their dependencies. Click a node for detail; drag to rearrange.

Legend

Assumption

Method

Empirical

Logical

Mechanism

Synthesis

Open

Distortion

Crux node (halo)

Node size reflects load-bearing weight (1–5).

Click a node to see its claim, status, and load-bearing weight. Hover an edge to see the relation type. Drag nodes to rearrange.

Click a node for its claim and load-bearing weight; hover an edge for the relation type; drag to rearrange. The variant toggles read the same graph through different lenses (vulnerability, flow, minimal claim set, politicization).

How to read this graph

Every node in the lit review collapses to one of eight types. Edges between them carry one of seven relations. Together they make the structure inspectable.

Node types

Code	Type	What it is
A	Foundational assumption	A claim the field cannot operate without; if false, large downstream regions collapse
M	Methodological prerequisite	A study design or estimation tool that must work for the empirical claims to be testable
E	Empirical claim	A specific measured finding with an effect size and replication status
L	Logical necessity	Follows from definitions or algebra; not empirically refutable
G	Generating mechanism	A causal process that explains a pattern (rGE, AM, niche-picking, critical periods)
S	Synthesis claim	An integrative statement combining multiple lower-level claims
O	Open question	Genuinely undecided with current methods or evidence
D	Distortion vector	Where motivated reasoning concentrates (typed by direction)

Edge types

Code	Edge	Meaning
dep	depends-on	If target collapses, source collapses
imp	implies	Logical implication
sup	empirically-supports	Evidence relation
conf	confounds / inflates	Artifact relationship (e.g., AM inflates rg)
mod	moderates	Changes magnitude (e.g., SES × heritability)
dev	develops-into	Temporal/developmental successor (temperament → personality)
corr	corrects	Within-family corrects between-family bias

Weight scale (load-bearing weight, 1–5)

5 — crux node; collapse propagates across multiple sections of the lit review
4 — load-bearing within a section
3 — important but local
2 — corroborating
1 — decorative; could be removed without changing the picture

1. Node catalog

Each node carries: type code · weight · short claim · key citation · status. Status flags: ✓ (robust/replicated), ~ (partial/qualified), ? (contested), ✗ (refuted, kept as historical reference).

A — Foundational assumptions

ID	Wt	Claim	Status
A1	5	Twin/adoption methods provide approximately valid variance decomposition (EEA modestly violated but not fatally)	✓
A2	5	GWAS signal reflects real genetic effects, not (only) population stratification or AM artifact	✓ partial
A3	5	A general factor of cognitive ability `g` is a real dimension of individual variation (positive manifold)	✓ statistical / ? mechanism
A4	3	Heritability findings apply to the population sampled, not to individuals or other populations (scope)	✓
A5	3	Phenotypes are reliably and validly measurable across cultures and time	~
A6	3	Most psychological variation is dimensional, not taxonic	✓

M — Methodological prerequisites

ID	Wt	Tool	Notes
M1	4	Twin studies (MZ/DZ) at scale	Polderman 2015 meta: 14.5M pairs
M2	4	Adoption studies, especially cross-cultural (Korean-American)	Sacerdote 2007; Beauchamp 2023
M3	5	GWAS at N ≥ 100k (ideally ≥ 1M for personality/EA)	Okbay 2022 (3M for EA)
M4	5	Within-family designs (sibling FE, MZ-discordant, parent-offspring trios). Kong 2018; Okbay 2022 (EA, N=3M); Howe et al. 2022 Nature Genetics extended this to 178k siblings × 25 phenotypes — within-sibship estimates were systematically smaller than population estimates for height, EA, cognitive ability, depressive symptoms, smoking. The within-family approach is now mature beyond just educational attainment	✓
M5	4	Polygenic scores (PGS)	Best R² ~0.16 for EA, ~0.10 for SCZ
M6	3	Cross-trait LD-score regression for genetic correlations	Brainstorm 2018
M7	3	Mendelian randomization	For causal inference from observational data
M8	3	Pre-registration & collaborative meta-analysis	Demolished candidate-GxE
M9	4	Whole-genome sequencing (rare-variant capture). 2025 follow-up (UK Biobank ~500k, Wainschtein et al. 2025 Nature) captures ~88% of pedigree-based narrow-sense heritability across many traits (20% rare + 68% common variants). The “missing heritability” problem is now substantially resolved for many phenotypes	✓

E — Empirical claims

Cognition / IQ:

ID	Wt	Claim	Status
E1	5	Mean trait heritability ≈ 0.49 across 17,804 traits (Polderman 2015)	✓
E2	5	Shared environment C ≈ 0 for adult personality and most adult cognition	✓ with exceptions (EA, religiosity, politics)
E3	4	Wilson Effect: IQ heritability rises from ~0.20 (age 5) to ~0.80 (adulthood)	✓
E4	5	Hyper-polygenic architecture: thousands of small-effect variants	✓ (Turkheimer’s 4th Law, Chabris 2015)
E5	4	Candidate-gene approach for psychiatric/personality traits failed (5-HTTLPR etc.)	✗ original claims; ✓ collapse finding
E6	5	Within-family PGS effects are ~½ population-level effects (genetic nurture)	✓ for EA, BMI, height
E7	5	Cross-trait assortative mating explains R²≈74% of variance in genetic correlation estimates	✓ (Border 2022, Science)
E8	4	Lead exposure 1–10 µg/dL → −6.2 IQ pts (Lanphear 2005)	✓
E9	4	Each year of schooling adds ~3.4 IQ points, persisting into old age	✓ (Ritchie & Tucker-Drob 2018)
E22	4	Within-population heritability does not license between-population inference	✓ logical
E23	4	PGS prediction accuracy decays continuously along the genetic-distance continuum from training population (Pearson r = −0.95 between genetic distance and PGS accuracy across 84 traits, Ding et al. 2023, Nature). Reframes the older “discrete ancestry-group drop” picture (Martin 2019; Mostafavi 2020)	✓
E24	3	Flynn Effect and its post-1990s reversal are both environmentally driven (within-sibship evidence)	✓ pattern; ? cause
E25	2	Scarr-Rowe: SES × heritability hypothesis (more genetic expression in higher-SES). Weakening further in 2024 — Ghirardi et al. 2024 found 39/42 PGI×SES interactions in education NEGATIVE (compensatory direction); only 1 significant positive. Pattern is now closer to “compensatory hypothesis holds, Scarr-Rowe fails” than to “context-dependent”	✗
E18	4	Positive manifold: every cognitive test correlates positively with every other	✓
E26	3	Childhood IQ → all-cause mortality: each 1-SD ≈ 24% lower mortality	✓ (Calvin 2011)
E27	3	Lifespan IQ stability: Lothian Birth Cohort age-11 → age-90 r ≈ 0.67	✓
E28	3	Severe iodine/alcohol/deprivation cause large asymmetric IQ effects	✓

Personality / temperament:

ID	Wt	Claim	Status
E29	4	Big Five h² ≈ 0.40–0.60; cross-cultural replication for E/A/C	✓ partial (Tsimane qualifier)
E30	4	Cumulative continuity: rank-order stability rises to ~0.74 by midlife	✓
E31	4	Maturity principle: mean-level ↑ in C, A, ES with age	✓
E32	3	Temperament dimensions (Surgency, Negative Affectivity, Effortful Control) → adult personality	✓
E33	3	Personality predicts mortality/divorce/income at magnitudes ≈ SES & cognition	✓ (Roberts 2007)

Sex differences:

ID	Wt	Claim	Status
E10	4	Multivariate Big Five sex difference: D ≈ 2.71 (~10% overlap)	~ method-sensitive
E11	4	People-things interest difference d ≈ 0.93 (largest in psychology)	✓
E12	3	Mental rotation d ≈ 0.56–0.73 male advantage	✓
E13	3	Math performance d ≈ 0.05–0.10 (essentially equal)	✓
E14	4	Gender Equality Paradox: differences larger in egalitarian/wealthier countries. Herlitz et al. 2025 systematic review (54 articles, 27 meta-analyses) confirmed the pattern across personality, verbal abilities, episodic memory, and negative emotions — pattern replication has strengthened, not weakened	✓ pattern; ? mechanism
E34	3	Physical aggression d ≈ 0.40–0.60 male; ~95% of homicides male	✓
E35	2	CAH girls show masculinized toy preferences; primate parallels	✓

Psychopathology:

ID	Wt	Claim	Status
E15	4	All major psychiatric disorders highly heritable (h² 0.35–0.85) and hyper-polygenic	✓
E16	4	Cross-disorder genetic correlations exist	✓ existence; ? magnitude post-AM
E17	3	A `p` factor (general psychopathology) fits cross-syndrome data	✓ statistical; ? interpretation
E36	3	Autism: common-PGS positively correlated with IQ; rare/de-novo drives ID-comorbid cases	✓
E37	2	Critical-period plasticity (GABAergic, perineuronal nets) is mechanistically real	✓

L — Logical necessities

ID	Wt	Claim
L1	5	Heritability is a population variance ratio; it does not partition individual phenotypes (mathematical form — A4 is the scope-of-claim sibling)
L2	4	h² changes with environmental variance: hold genes constant, equalize environments → h² → 1
L3	5	Within-family designs control for between-family confounds (rGE, stratification, AM)
L4	5	Within-population heritability provides no information about between-population mean differences (Lewontin). E22 in the empirical column is the applied form of this same point
L5	3	Multivariate D ≥ max(univariate d) when component dimensions are positively correlated
L6	3	Positive manifold permits both unitary-cause and emergent-network interpretations of g
L7	3	Effect-size interpretation is scale-dependent (d=0.10 trivial in trait psychology, large in clinical)

G — Generating mechanisms

ID	Wt	Mechanism	Drives
G1	4	Active rGE / niche-picking	E3 (Wilson Effect amplification)
G2	5	Passive rGE. Wang 2021 / Isungset 2022 confirm indirect ≈ ½ direct genetic effect for EA. Nivard et al. 2024 found indirect genetic effects on offspring achievement extend beyond the nuclear family — dynastic / extended-family / community processes contribute, so the “parents transmit gene + correlated environment” framing understates the spread	E6 (genetic nurture); inflation of population-level h²
G3	3	Evocative rGE	Heritability of “environments” (Kendler & Baker 2007)
G4	3	Critical-period plasticity (GABAergic maturation)	Asymmetric environmental effects on early development
G5	4	Assortative mating → LD induction	Inflates additive genetic variance, h²
G6	5	Cross-trait AM → spurious genetic correlations	Confounds E16, E17 (p-factor)
G7	4	Stochastic developmental noise	Dominant source of non-shared environment
G8	3	Selection / niche construction across the lifespan	Bridges temperament → personality → outcome cascade

S — Synthesis claims

ID	Wt	Claim
S1	5	”Genes vs. environment” is the wrong frame; the system is tightly coupled (genome × rGE × AM × few large environmental insults × stochastic noise × culture × developmental unfolding)
S2	5	Twin h² ≥ SNP h² ≥ within-family h² gradient quantifies AM/rGE/measurement inflation across estimation methods
S3	4	Heritability ≠ destiny; high h² is compatible with large environmental shifts (height: h² ≈ 0.80, +10cm in a century)
S4	4	Most “non-shared environment” is stochastic, not systematic — it accounts for ~50% of personality variance and is poorly characterized
S5	4	Two parallel hierarchies (CHC for cognition, HiTOP for psychopathology) connected at the top by inverse g↔p genetic correlation
S6	4	Developmental cascade: temperament (infant biological reactivity) → personality (adult social-cognitive layer added) → outcomes (mortality, attainment, relationships) with h² ↑ and shared-env ↓ across the lifespan

O — Open questions

ID	Wt	Question	Why it matters
O1	5	Mechanistic interpretation of PGS: “causal genetic” (Plomin) vs. “weak explanation” (Turkheimer)	Determines what PGS prediction means
O2	3	Cause of Flynn-effect reversal post-1990s	Empirical pattern robust (Bratsberg & Rogeberg 2018 within-sibship Norway). Mechanism still unsettled. Pietschnig et al. 2024 (Vienna 2005–2018 cohort) added a wrinkle: the positive manifold itself may be weakening — gains in some abilities aren’t tracking gains in others, suggesting the g-loading of the rise/fall is not constant. Hypothesized mechanisms (screens, reduced long-form reading, attention) circulate without empirical pinning
O3	4	Causal mechanism behind Gender Equality Paradox	Innate-expression release vs. measurement artifact vs. confound — selection of explanation has political stakes
O4	5	Between-population mean differences: any genetic component?	Currently scientifically unanswerable with available methods (PGS portability too poor, cross-ancestry GWAS at scale don’t exist). Honest position: unresolved, not settled in either direction
O5	3	g architecture: latent common cause vs. emergent network (mutualism, van der Maas 2006)	Affects how interventions could in principle move g
O6	4	What “non-shared environment” actually is: stochastic noise, immune/microbial, peer networks, epigenetic, measurement error	Largest unmodeled variance component in personality
O7	5	Magnitude of AM-correction across the cross-disorder genetic correlation matrix	Active revision. Ma, Wang, Border et al. 2024 AJHG introduced LAVA-Knock — a local-genetic-correlation method that reduces xAM-induced bias. Methods to give the answer are now emerging, not just to flag the problem

D — Distortion vectors (where motivated reasoning concentrates)

ID	Direction	Targets	Failure mode
D1	Blank-slate / environmentalist	A1, E1, E10–E14	Dismiss twin studies wholesale; oversell transgenerational epigenetics; overstate stereotype threat; minimize sex differences via univariate-only framing
D2	Hereditarian	L4, E22, E23, O4	Ignore Lewontin; treat g-loadedness of gaps as evidence of genetic etiology; cite fringe admixture studies; ignore AM/rGE corrections to PGS
D3	”Gender similarities” minimization	E10, E11, E14	Selective citation (math d=0.05) to imply no differences anywhere; obscure D=2.71 multivariate; minimize d=0.93 interest gap
D4	Pop evpsych overgeneralization	E10–E14, A6	Treat dimensional ds as taxonic; extrapolate small ds to categorical claims; overgeneralize from specific tasks to broad-domain claims

2. Dependency cascade

The cascade reads from foundations up to synthesis, and from corrections back down to corrected claims.

Forward cascade (foundations → empirical claims → synthesis)

A1 ──dep──> M1, M2 ──sup──> E1, E2, E3, E25, E29
A2 ──dep──> M3, M5 ──sup──> E4, E6, E7, E15–E17, E22–E23
A3 ──dep──> E18 ──sup──> E26, E27 ──imp──> S5
A4 (scope) + L1 (form) ──guards──> interpretation of E1, E2, E3 and S3
A6 ──imp──> S5 (dimensional turn in psychiatry)
M9 ──corr──> E1 (closes missing-heritability gap)
M3 + M4 ──sup──> E6 (genetic nurture), E7 (xAM)
E5 (candidate-gene collapse) ──imp──> E4 (polygenic architecture confirmed by absence of large hits)

E1 + E2 + E3 + G1 ──imp──> S6 (developmental cascade)
E1 + E4 + G2 + G5 ──imp──> S2 (h² gradient by method)
E10 + E11 + E12 + E13 + L5 ──imp──> "small univariate, large multivariate" sex-difference picture
E14 + O3 ──imp──> mechanism-pending GEP
E15 + E16 + E17 ──imp──> S5 (HiTOP/p)
E22 + E23 + L4 ──imp──> O4 (between-pop unanswerable currently)
E1 + E2 + E4 + E6 + E7 + E8 + E9 + G1–G7 ──imp──> S1, S2 (integrated picture)

Backward / corrective cascade (newer evidence revises older claims)

G2 (passive rGE) ──corr──> E1 estimates (population-level overstates direct genetic)
G6 (cross-trait AM) ──corr──> E16 (some psychiatric rg's may be xAM artifact)
M4 (within-family) ──corr──> E6 magnitude (~½ of population PGS)
M8 (preregistration) ──corr──> E5 (collapsed candidate-GxE)
M9 (WGS) ──corr──> "missing heritability" interpretation

Distortion → target edges

D1 ──attacks──> A1, E1, E10, E11, E12, E14
D2 ──attacks──> L4, E23 (ignores), exploits A2 absent corrections from G2/G6
D3 ──attacks──> E10, E11 (selective univariate framing)
D4 ──attacks──> A6, L7

3. Where pressure concentrates

A common failure mode in this literature is to treat all high-stakes nodes as the same type of thing. They are not. The graph has three distinct categories of high-stakes node and one category of fragile claim — keeping these separate sharpens what the field actually needs to resolve.

3a. Foundational cruxes — falsification breaks regions of the picture

These are the empirical-or-methodological assumptions that, if wrong, force rebuilding large parts of the lit review.

A1 — Twin/adoption method validity. Carries Section 1 of the lit review; heritability-by-domain table; Wilson Effect. Robustness: HIGH (MZ-reared-apart, SNP-h² bypassing EEA, misperceived-zygosity all converge). Would flip if SNP-h² for psychological traits systematically converged on <0.05 — has not occurred.

A2 — GWAS signal is real (not artifact). Carries the PGS enterprise; genetic nurture estimates; cross-disorder pleiotropy; modern psychiatric genetics. Robustness: MODERATE-HIGH (within-family PGS effects are non-zero for EA, BMI, height — direct signal exists; AM/stratification inflation magnitudes still being quantified). Would flip if within-family PGS effects converged on zero across most traits.

A3 — g is a real dimension of cognitive variation. Carries Section 5 of lit review; predictive-validity claims; CHC structure; mortality/income predictions. Robustness: HIGH for g as a statistical regularity; MODERATE for g as unitary biological mechanism. Would flip if a broad cognitive battery had first-PC <15% or if interventions reliably moved one ability while lowering others. 2024 wrinkle: Pietschnig et al. 2024 reported the positive manifold may be weakening across recent cohorts — softly pressures A3 in a new way without refuting it.

3b. Reframer nodes — the answer is open and reshapes interpretation

These don’t break the picture if reversed; they change what the picture means. Their magnitudes are being actively quantified in 2024–2026 work. Conflating reframers with foundational cruxes is the most common conceptual error in pop-science treatments of this field.

G2 / E6 — Passive rGE / genetic nurture. Reframes the meaning of every population-level genetic estimate. Without G2, “genetic transmission” reads as direct biological causation; with G2, ~half is environmentally mediated by genetically-similar parents (Wang 2021 / Isungset 2022). Nivard et al. 2024 (Nat Hum Behav) showed indirect genetic effects extend beyond the nuclear family to dynastic / extended-family processes. The existence is robust; precise magnitude across all traits is still being quantified.

G6 / E7 — Cross-trait assortative mating. Reframes the cross-disorder genetic-correlation matrix and the p-factor’s interpretation. Border 2022 (Science) showed phenotypic cross-mate correlations explain R²=74% of variance in genetic-correlation estimates. Ma, Wang, Border et al. 2024 (LAVA-Knock) is the first method to systematically reduce xAM-induced bias. The share of any specific rg that is artifact vs. genuine pleiotropy is still pending.

3c. Logical guardrails — unfalsifiable but load-bearing for interpretation

These cannot be falsified — they are algebraic / definitional truths. They can be ignored, which is how most public-discourse misuse of the field happens.

L1 — Heritability is a population variance ratio, not an individual partition. Cannot be falsified. Public misreading of “70% heritable IQ” as “70% of any individual’s IQ comes from genes” is the failure of L1, not the science.

L4 — Within-population heritability does not license between-population mean inference (Lewontin firewall). Cannot be falsified — it is a logical/algebraic point. Can only be ignored. The empirical buttress today is E23 (PGS portability collapse along genetic-distance continuum, Ding 2023): even if you wanted to use within-pop methods to speak to between-pop differences, the methods don’t currently work.

3d. Decorative material (safe to compress)

Removable from the topology without changing the qualitative picture:

E35 (CAH / primate toy preferences) — convergent evidence, not necessary
E37 (specific GABAergic critical-period mechanisms) — biologically real, not load-bearing for the variation argument
HEXACO Honesty-Humility specifics — incremental over Big Five
Specific Dark-Triad subdimensions — D-factor synthesis (Moshagen 2018) carries more weight
P-FIT brain network specifics — corroborate g but don’t establish it
Yehuda Holocaust FKBP5 transgenerational findings — refuted/non-replicated; kept only as historical anchor for D1 distortion
Specific candidate-gene findings (5-HTTLPR depression) — refuted; kept as historical anchor for the field’s methodological turn (M8)

4. Weakest links

These are the load-bearing pieces with the lowest current confidence. Targeted attack on any one would do the most damage to the integrated picture.

W1: Generalization from candidate-GxE failure to “all GxE is small” (E5 → broader claim)

Why fragile: The candidate-gene collapse is definitive. The extrapolation that polygenic-score × environment interactions are also small is an inductive leap, not a result. As of 2025, the picture is partially holding the null but not strengthening it. A 2025 systematic review of 56 PGS×E studies for depression found mostly null or small effects. A multivariable PGS×E study of educational achievement (Allegrini et al. 2020) found “no evidence that GxE effects significantly contributed to multivariable prediction.” UK Biobank work (2024) on distinct explanations of GxE shows that many apparent GxE signals are confounded by scale, ascertainment, or population structure. The candidate-gene-failure extrapolation is looking less like an inductive leap and more like a substantive empirical pattern — but the literature is still too young for a strong null.

Pressure test: Several large preregistered PGS×E studies finding interactions explaining >5% variance would substantially revise this corner of the picture.

W2: Scarr-Rowe (E25) — has substantially weakened since pass-0

Why fragile: The original meta-analytic picture was “replicates in US, fails in W. Europe / Australia” (Tucker-Drob & Bates 2016). Ghirardi et al. 2024 (Netherlands Twin Register, polygenic-index design across 42 PGI×SES interaction tests for educational outcomes) found 39/42 negative, 0 significant positive, 1 marginally significant positive — i.e., the opposite sign from Scarr-Rowe in most cases. The picture in 2026 is closer to “the compensatory hypothesis (more genetic expression in low-SES because constrained environments suppress non-genetic variance) is the better-supported pattern, at least for educational outcomes.” E25’s weight has been downgraded from 3 → 2 to reflect this. The narrative “deprivation suppresses heritability” — popular in policy discourse — is now evidence-thin.

W3: Plomin-vs-Turkheimer interpretation of PGS (O1)

Why fragile: Both views are compatible with current data. Determines what PGS means — direct biology vs. summary statistic of correlated environments. The field publishes ambiguously across both interpretations. Will likely be settled only by within-family-only PGS that are still well-powered.

W4: Magnitude of AM-correction across psychiatric cross-disorder rg matrix (O7) — methods now emerging

Why still fragile but improving: Border 2022 showed xAM explains R²=74% of variance in genetic-correlation estimates but didn’t prove all rg’s are spurious — some genuine pleiotropy surely exists. As of 2024, the field is moving from flagging the problem to building correction methods. Ma, Wang, Border et al. 2024 (American Journal of Human Genetics) introduced LAVA-Knock, a local-genetic-correlation method using knockoff inference to reduce xAM-induced bias; tested across 630 trait pairs in simulation and real GWAS, it substantially reduces but does not eliminate the bias. A 2024 study found AM genetic signatures across SCZ, BD, MDD, alcohol phenotypes, and Tourette syndrome — confirming xAM is not selective. What’s still pending: how much of the cross-disorder rg matrix and the p-factor genetic signal survives systematic application of AM-correction methods at scale. Likely answer in 2–3 years.

W5: Gender Equality Paradox mechanism (O3, E14) — pattern strengthened, mechanism still contested

Empirical pattern: more robust as of 2025. Herlitz et al. 2025 systematic review (54 articles, 27 meta-analyses, Perspectives on Psychological Science) found the paradox replicates across personality, verbal abilities, episodic memory, and negative emotions. Balducci et al. 2024 extended it to within-individual academic strengths cross-temporally. The “this won’t replicate” objection has weakened.

Mechanism: still contested. Three live candidates: (a) innate-expression release in resource-rich environments, (b) reference-group / self-anchoring artifacts in self-report (people compare to their gender peers, not to humans-in-general), (c) wealth/freedom confounds with gender equality. Behavioral / incentivized-measure replications (Falk & Hermle 2018 for economic preferences) cover only part of the domain. The decisive test — non-self-report behavioral replication across personality and interests — is still incomplete. Each candidate mechanism implies different normative conclusions, which is part of why this remains contested rather than resolved.

W6: Flynn-reversal cause (O2) — and a new wrinkle on the positive manifold

Why fragile: The pattern is environmentally driven (within-sibship, Bratsberg & Rogeberg 2018), so “dysgenic” explanations are out. But no mechanism (screen time, education quality, attention, nutrition, lead, microplastics) has been pinned down with within-cohort empirical work. Pietschnig et al. 2024 (Vienna 2005–2018 cohort) added a structural twist: the positive manifold itself may be weakening across cohorts — meaning the recent rise/fall is not uniformly g-loaded. If confirmed broadly, this softly pressures A3 (g exists as a stable dimension) — not refuting it, but suggesting its strength may be cohort-dependent. Still not load-bearing for the integrated picture, but interacts with A3 in a new way.

W7: A6 (dimensional vs. taxonic) at psychiatric extremes

Why fragile: Most psychopathology is dimensional (taxometric evidence is robust), but for severe early-onset autism with intellectual disability, rare large-effect variants (CHD8, SCN2A, SYNGAP1) drive a partly taxonic picture. The “all dimensional” framing oversells continuity at the severe tail.

5. Variant views

The same graph, read four ways.

Variant A: Vulnerability map — where does this break?

The vulnerability map is the union of the three foundational cruxes (§3a), two reframer nodes (§3b), two logical guardrails (§3c), and seven weakest links (§4). Together they describe the smallest set of pressure points whose movement would force restructuring of the integrated picture:

Falsify A1: SNP-h² systematically <0.05 → twin-method discredited → Section 1 collapses
Falsify A2: within-family PGS → 0 → modern psychiatric genetics collapses
Falsify A3: positive manifold dissolves → Section 5 collapses
Falsify G2: within-family PGS = population PGS → genetic nurture is null → Plomin direct-causal view wins (O1 resolves)
Falsify G6 fully: AM correction barely changes rg matrix → cross-disorder pleiotropy is real
Violate L4: cannot be falsified, only ignored — but its violation in public discourse is the largest single source of public confusion

If exactly one of these were to flip, the rebuild would be: A1→ rebuild Section 1 only; A2→ rebuild Sections 1, 3, 7 (~40% of lit review); A3→ rebuild Section 5 (~25%); G2/G6→ keep numbers, rewrite causal interpretation throughout.

Variant B: Flow map — how does causation propagate?

Causation in this system runs in two directions, both important.

Forward developmental flow (genome → outcomes):

Genome (polygenic + few rare large-effect)
    │
    ├──> Temperament (infant biological reactivity: Surgency / NA / EC)
    │       │
    │       ├──> Active rGE / niche-picking ─────────┐
    │       │                                        │
    │       └──> Evocative rGE (eliciting responses) ┤
    │                                                │
    └──> Direct expression in brain development ─────┤
                                                     ▼
                                                Personality (adult)
                                                     │
                                                     ├──> Attainment
                                                     ├──> Relationships
                                                     ├──> Health behaviors
                                                     └──> Mortality

Indirect / dynastic flow (parents’ genome → offspring environment → offspring outcome):

Parents' genome
    │
    ├──> Parents' phenotype (income, vocabulary, parenting style, neighborhood choice)
    │       │
    │       └──> Offspring's rearing environment ────────┐
    │                                                    ▼
    │                                              Offspring outcomes
    │                                                    ▲
    └──> Transmitted alleles ───────────────────────────┘

The genetic-nurture finding (E6) says these two pathways have roughly equal magnitude for educational attainment. They are partially separable only via within-family designs (M4) or non-transmitted-allele PGS.

Cross-generational drift via assortative mating:

Mating choice (correlated on phenotype) 
    │
    └──> LD induction among causal variants (G5)
              │
              ├──> Inflated additive genetic variance
              ├──> Inflated h²
              ├──> Inflated cross-trait genetic correlations (G6)
              └──> Inflated PGS prediction accuracy

Variant C: Minimal claim set — smallest set supporting the conclusion

The smallest collection of claims that yields the integrated picture (S1) is eight nodes:

E1 — Mean trait h² ≈ 0.49 (heritability is real and substantial)
E4 — Polygenic architecture (no master genes)
E6 — Within-family PGS ≈ ½ population PGS (genetic nurture is real)
E7 — Cross-trait AM is a major source of inflated genetic correlations
E8 — A small set of large environmental insults have causal effects (lead, iodine, alcohol, deprivation, schooling)
L4 — Within-pop ≠ between-pop (Lewontin)
G7 — Stochastic developmental noise is the dominant source of non-shared environment
A6 — Most psychological variation is dimensional, not taxonic

These eight together generate the qualitative integrated picture without requiring detailed effect-size tables, cross-cultural caveats, or specific candidate-gene history. The remaining ~50 nodes refine and corroborate but do not change the shape.

Variant D: Politicization map — where does motivated reasoning concentrate?

This is the variant most relevant to the topic framing (“a minefield of motivated reasoning on all sides”).

Distortion-to-target matrix:

Distortion	Targets	Move	Counter-evidence
D1 Blank-slate	A1, E1, E10–E14	”Twin studies are flawed; differences are socialization”	SNP-h² (bypasses EEA), MZ-reared-apart, Su 2009 (d=0.93), CAH/primate convergence
D2 Hereditarian	L4, E22, E23, O4	”Group differences are genetic”	PGS portability collapse (E23); Lewontin (L4); cross-ancestry GWAS at scale don’t exist
D3 Gender-similarities	E10, E11, E14	”All differences are tiny (cite math d=0.05)“	Multivariate D=2.71 (E10); people-things d=0.93 (E11); GEP (E14)
D4 Pop-evpsych	A6, L7, E10–E14	”Men are X, women are Y” (categorical from dimensional)	A6 (dimensional); L7 (effect-size context)

Why all four distortions can target the same evidence base: the evidence base contains both large differences (people-things d=0.93) and trivial ones (math d=0.05) and strong heritability (h²=0.49) and large environmental insults (lead, schooling) and logical guardrails against between-group inference (L4). Any single-direction narrative requires selective citation. The integrated picture (S1) requires holding all of it at once.

Operational implication for the formalization stage: any model that only parameterizes the variance components without parameterizing the interpretation of those components will be silently captured by whichever distortion the reader is most prone to. The formal model needs to make L4, G2, G6, and the dimensional/taxonic distinction (A6) structurally visible, not just numerically present.

6. Topology → formalization handoff

What the next stage (model formalization) should pick up.

Ready for equations

Variance decomposition — fully specifiable now:

V(P) = V(A_direct) + V(A_indirect) + V(A_AM-LD) + V(C_residual) + V(E_measured) + V(E_stochastic) + 2·Cov(G,E) + V(GxE)

With each V parameterized by trait, age, population (US vs. Europe for E25), and method (twin / SNP / within-family). Cov(G,E) captures rGE; V(A_AM-LD) captures G5/G6; V(A_indirect) captures G2.
Method gradient (S2): twin h² ≥ SNP h² ≥ within-family h², with the gaps decomposable into AM, rGE, and rare-variant contributions. Parameterize as a function of estimation method.
Wilson-effect curve: h²(age) = a + b·log(age) or similar saturation form, with the slope driven by G1 (active rGE). Calibratable from Bouchard 2013 and Briley & Tucker-Drob 2013.
Multivariate sex-difference algebra: D² = (μ₁ - μ₂)ᵀ Σ⁻¹ (μ₁ - μ₂), with a worked example showing how D = 2.71 follows from moderate univariate ds and a positive-correlation covariance structure.
PGS-portability decay function: prediction accuracy as a continuous function of genetic distance from training population (Ding et al. 2023, Nature: r = −0.95 between genetic distance and accuracy across 84 traits).

Still at observation stage (formalization premature)

O1 — Plomin/Turkheimer interpretation of PGS: not yet a formal disagreement, just a verbal one
O3 — GEP mechanism: the algebra of “innate expression release” is not yet specified
O6 — what non-shared environment is: no candidate decomposition
O7 — share of cross-disorder rg that survives AM correction: empirical question pending — but methods are now emerging (LAVA-Knock); answer likely in 2–3 years, at which point this moves to “ready for equations”

Connection to adjacent topics in the LLM-iterate pipeline

This topology is the natural input to Parent-to-Child Transmission (planned topic). The genetic-nurture finding (G2/E6) and the dynastic-extension finding (Nivard 2024) are the empirical answers to “how much does parenting matter beyond genes” that the parent-child topic will need to build on. When that topic spins up, the variance decomposition equation here should be its starting point.

Less directly: the Evolution-Modernity Mismatch topic will lean on the GEP (E14/O3) and Flynn-reversal (O2) findings as evidence of environment-driven shifts in expressed psychological variation. Bedrock Generating Functions can read the variance decomposition itself as one such bedrock function.

7. Next moves — three options for Stage 3

The user picks one of these as the primary formalization target. Each leaves the others viable as later modules but shapes Stage 4 (data) differently.

Option A — Variance decomposition + method gradient (most central)

Build the central equation V(P) = V(A_direct) + V(A_indirect) + V(A_AM-LD) + V(C) + V(E_meas) + V(E_stoch) + 2·Cov(G,E) + V(GxE) parameterized by trait, age, population, and estimation method. Build a tool that takes a published h² estimate (twin, SNP, or within-family) and outputs a method-corrected estimate with explicit AM/rGE adjustment.

Pros: most central to the topic; directly answers “what generates psychological variation”; feeds Stage 4 cleanly (every term has published estimates somewhere). Cons: many parameters; risk of producing a calculator nobody uses without strong UI judgment. Stage 4 implication: pull h² estimates from PGC, SSGAC, GIANT consortia; calibrate the method-gradient term per trait class.

Option B — Multivariate sex-difference algebra (most pedagogically clean)

Formalize how moderate univariate Cohen’s ds combine into a large multivariate Mahalanobis D, with a worked Big-Five example showing how D ≈ 2.71 emerges from |d| ≈ 0.4 ds and a positive-correlation Σ. Build a dashboard letting the user dial univariate ds and the correlation matrix to see D move.

Pros: tightly scoped; resolves the single biggest framing trap in the GEP debate (univariate vs. multivariate framings of the same data); high pedagogical leverage. Cons: narrower than A; doesn’t engage the heritability core. Stage 4 implication: pull effect-size matrices from Del Giudice 2012 and Schmitt 2008 cross-cultural data; replicate D under different correlation structures.

Option C — PGS-portability calibration (most practically useful)

Turn Ding et al. 2023’s continuous decay finding into a usable accuracy estimator: enter an individual’s genetic distance from the PGS training population and get an accuracy-decay multiplier. Apply across the major trait PGSs (EA, SCZ, BMI, etc.).

Pros: directly addresses a real-world bias; smallest scope; ships fastest; useful even outside this project’s domain. Cons: less central to the heritability question; might fit better as a tool than a topic-stage. Stage 4 implication: pull cross-ancestry GWAS validation data from the All of Us / GenomeAsia / H3Africa consortia.

My recommendation: A as primary (most central to the topic’s stated purpose), with B as a stretch module if scope allows. C is high-value but might better live as a standalone tool promoted to /models later.

8. Objections to this topology (adversarial + steelman)

Four ways a careful reader could push back. The strongest version of each, then my response.

Objection 1 — Discrete typed edges falsify a continuous, magnitude-weighted, context-conditional system

Heritability is not “supported by” a twin study in the same binary way that a logical implication holds. The system is a tightly coupled developmental process; flattening it into nodes-and-arrows with discrete edge types loses information about magnitude, conditional dependence, and gradient relationships.

Response: Acknowledged, and intentional. The topology is the qualitative skeleton; edge weights and conditional dependencies are the job of Stage 3 (formalization), where each edge will be turned into a parameterized function. The graph’s value is not that it stands in for the full system but that it makes the structure visible cheaply enough that the formalization knows where to put the parameters.

Objection 2 — The crux/decorative split is editorial, not empirical

There is no algorithm that picks crux nodes; the choice depends on which failure modes you are worried about. A 1990s topology of this field would have crowned candidate-gene findings as cruxes. Naming A2 (GWAS signal real) a crux today is a judgment call about the field’s current methodological commitments — not an objective feature of the science.

Response: Correct. Cruxes are time-stamped. This topology is a 2026 snapshot. If the field shifts (post-AM-correction era, post-within-family-PGS-at-scale era) the crux set will shift — that is what the refinement passes are for. Use this as a current map, not an immutable structural claim.

Objection 3 — Calling L4 a “logical firewall” overstates the case

Lewontin’s 1970 argument has been challenged. Edwards (2003) “Lewontin’s Fallacy” showed that Lewontin’s specific quantitative point — that ~85% of human genetic variance is within rather than between populations — does not preclude reliable population-classification from genetic markers. Modern population genetics treats between-population genetic inference as more nuanced than the firewall framing suggests.

Response: The Edwards critique is real, but it addresses a different claim. Edwards refuted “you cannot reliably classify individuals into populations from genetic data.” The L4 firewall as I formulate it says “within-population heritability provides no information about between-population mean differences without strong auxiliary assumptions about shared causal architecture and equal environments.” Those are different propositions. PGS portability collapse (E23) is the contemporary empirical evidence that the auxiliary assumptions are not currently being met for psychological traits. The firewall framing survives the Edwards critique; its strength rests on the empirical PGS-portability finding, not on the original Lewontin variance argument alone.

Objection 4 — The Politicization variant is meta-commentary, not topology

The D nodes and attacks edges describe how people misuse the evidence base. That is epistemics or sociology of science, not structural topology of the field. A pure topology should omit them.

Response: Fair, and the inclusion is non-orthodox. It is justified here only by the topic framing — the user’s prompt explicitly described the field as “a minefield of motivated reasoning … where the actual generating functions are obscured by politics.” A topology of just the science would omit the D nodes; a topology that helps a reader navigate the field as it is actually encountered should include them. The D nodes will not be carried into Stage 3 formalization — they exist for navigation, not for downstream computation.

9. Glossary

For readers approaching this from outside the field. Terms appear throughout the lit review and topology; this is the lookup table.

Term	Meaning
h²	Heritability — fraction of trait variance in a population attributable to genetic variation. A population statistic, not an individual one.
SNP	Single-nucleotide polymorphism — a single-base difference at a position in the genome where multiple variants exist in the population.
GWAS	Genome-wide association study — scans hundreds of thousands of SNPs against a measured trait, looking for statistical association.
PGS	Polygenic score — a per-individual sum of trait-associated SNPs weighted by their GWAS effect sizes. Used as a predictor.
LD	Linkage disequilibrium — non-random association between alleles at nearby loci, typically because they are inherited together.
AM	Assortative mating — partners resemble each other on a trait above chance. xAM = cross-trait AM (e.g., taller-than-average partners with more-educated-than-average).
rGE	Gene-environment correlation. Passive (parents transmit genes + correlated environment), evocative (heritable traits elicit responses), active (people select environments matching propensities).
GxE	Gene-environment interaction — the same genotype produces different phenotypes in different environments.
EEA	Equal environments assumption — the twin-method assumption that MZ and DZ twins are treated similarly enough that any extra MZ phenotypic resemblance reflects genetics, not differential treatment.
MZ / DZ	Monozygotic (identical, ~100% shared DNA) / dizygotic (fraternal, ~50% shared DNA) twins.
rg	Genetic correlation between two traits — how much the same genetic variants influence both.
WGS	Whole-genome sequencing — capturing every base in the genome, including rare variants GWAS misses.
g-factor	General factor of cognitive ability — the latent dimension behind the positive manifold (every cognitive test correlates positively with every other).
p-factor	Proposed general factor of psychopathology — analogous to g, derived from cross-syndrome correlations.
CHC / HiTOP	Cattell-Horn-Carroll cognitive-ability hierarchy / Hierarchical Taxonomy of Psychopathology (a dimensional alternative to DSM).
d (Cohen’s d)	Standardized mean difference between two groups, in standard-deviation units. Effect-size labels (small / medium / large) are scale-dependent — see L7 in the node catalog.
Mahalanobis D	Multivariate generalization of Cohen’s d — distance between two group means in the geometry of the trait space, accounting for correlation between traits.
Within-family design	Comparing siblings or MZ-discordant twins or parent-offspring trios within the same family — controls for between-family confounds (population stratification, AM, passive rGE).
Genetic nurture	Effect of parents’ genotype on offspring outcomes via the environment the parents create — including alleles the parent did not transmit.

10. Stage_outputs convention reference

Raw working drafts from each LLM-iterate stage live at:

stage_outputs/<topic>/<stage>.md

Where <topic> is kebab-case (e.g., human-psych-variation) and <stage> is one of: lit-review, topology, model, data, build. Polished versions move into src/content/ai_research/<topic>/<stage>.mdx with proper frontmatter (title, description, date, status, refinementPass, refinementLog) once ready to publish on the site.

The interactive D3 graph for this topology lives at src/components/research/PsychVariationGraph.tsx and is mounted in src/content/ai_research/human-psych-variation/topology.mdx via client:load.

Iteration history

Pass 3 2026-04-28

error check (citations)crux re-classificationredundancy pruneconnectionsnext moves

Why Pass-2 introduced two attribution errors (Figlio for Ghirardi; Wang for Wainschtein) that I caught with verification searches. Separately, calling A1/A2/A3 plus G2/G6 plus L4 all "cruxes" conflated three different types of high-stakes nodes — the most common conceptual error in pop-science treatments of this field. The "Load-bearing vs decorative" section was redundant with the node-catalog weights. And the document lacked a clear "choose your Stage 3 path" deliverable.
- Verified all pass-2 citations via WebFetch on PubMed
- Fixed Figlio et al. 2024 → Ghirardi et al. 2024 (lead author Gaia Ghirardi, Res Soc Strat Mobil) and corrected Norwegian → Netherlands Twin Register
- Fixed Wang et al. 2025 → Wainschtein et al. 2025 (Nature) — same Wainschtein who did the 2022 height paper, expanded to UK Biobank scale
- Restructured §3 from "6 cruxes" into three explicit categories: §3a Foundational cruxes (A1, A2, A3 — falsification breaks regions), §3b Reframer nodes (G2/E6, G6/E7 — open magnitude reshapes interpretation), §3c Logical guardrails (L1, L4 — unfalsifiable but load-bearing for interpretation), §3d Decorative material
- Dropped old §4 (Load-bearing vs decorative) — info already in node-catalog weights; decorative list moved into §3d
- Renumbered §5→§4 (weakest links), §6→§5 (variants), §7→§6 (handoff)
- Updated §5 Variant A vulnerability map to reference the new 3a/3b/3c/4 structure instead of the old "six cruxes + seven weakest links" framing
- Rewrote TLDR para 2 to introduce the foundational / reframer / guardrail trichotomy explicitly (the conceptual move that this pass installs)
- Added §6 connection note pointing to Parent-to-Child Transmission and Evolution-Modernity Mismatch as natural downstream consumers of this topology
- Added §7 Next moves — three discrete Stage-3 options (variance decomposition + method gradient / multivariate sex-difference algebra / PGS-portability calibration) with pros/cons/Stage-4 implications and an explicit recommendation
- Updated PGS-portability item in §6 from "Martin 2019 4.5× drop" to "Ding 2023 continuous decay"
- Component: synced Ghirardi and Wainschtein attributions in node detail strings
Pass 2 2026-04-28

gap scan with web research

Why Pass-1 deferred web research because the parent harness initially seemed to block WebSearch. After user pushback I confirmed WebSearch was available, ran 12 targeted searches across the topology weakest links and open questions, and incorporated the 2024-2025 findings that actually shift the picture.
- M9 (WGS): missing-heritability problem now substantially resolved (~88% of pedigree h² in 2025 paper)
- E23 (PGS portability): reframed as continuous decay along genetic-distance continuum (Ding 2023, r=−0.95)
- E25 (Scarr-Rowe): downgraded weight 3→2, status ~→✗; compensatory hypothesis is the better-supported pattern
- M4 (within-family designs): strengthened with Howe 2022 178k siblings × 25 phenotypes
- E14 (GEP): pattern strengthened by 2025 systematic review across multiple domains
- O7 (AM-correction magnitude): LAVA-Knock method now emerging
- G2 (passive rGE): extended to dynastic / extended-family processes (Nivard 2024)
- O2 (Flynn reversal): positive-manifold-weakening wrinkle (Pietschnig 2024)
- TLDR weakest-links paragraph rewritten to reflect 2026 state
Pass 1 2026-04-28

error checkgap scanadversarial + steelmanreadability

Why Pass-0 had a logical cycle (A3 ↔ E18), an unsupported quantitative claim, two near-duplicate nodes, one mistyped edge, and no defense against the strongest objections. Drag was also broken on the interactive graph.
- Fixed drag bug (pointer events with setPointerCapture)
- Removed bidirectional A3 ↔ E18 cycle
- Reframed E5 → E4 from "implies polygenic" to "consistent with polygenic"
- Reframed L1 → E1 edge type from mod to imp with explicit label
- Removed redundant A4 → L1 edge
- Softened TLDR claim from "~70% of inferential weight" to "sit upstream of most empirical and synthesis nodes"
- Added §8 Objections to this topology (4 objections + steelman + response)
- Added §9 Glossary (19 terms)