Diophantine Astrophysics arithmodynamics · a candidate field at the boundary of number theory and celestial mechanics

Which orbital architectures observed in the universe are forced by number-theoretic constraints, rather than selected by initial conditions or dynamics?

draft · day 2 · 2026-05-20 data: NASA Exoplanet Archive 5933 planets · 1037 multi-planet systems

Contents
  1. Why this is a real gap
  2. The conjectures (with one major reformulation)
  3. Test 1 — partial-quotient distribution vs Gauss-Kuzmin
  4. Test 2 — dynamical-age stratification
  5. Test 3 — RV-only control for selection bias
  6. Test 4 — Conjecture II at Farey-9
  7. Test 5 — Khinchin and Lévy constants flips Conj I
  8. Test 6 — Conjecture II with larger alphabets and Markov-chain test
  9. Test 7 — Mixture fit and pile-up structure quantifies Conj I″
  10. Synthesis and what to do next
  11. Reproduction

1. Why this is a real gap

Three established fields each see one face of the same elephant and don't talk to each other.

The bridge: the observed period-ratio distribution is the pushforward of the cosmic initial-condition measure through some sequence of number-theoretic filters. Day-1 assumed the filter was KAM stability (favouring noble irrationals). Day-2 falsifies that and identifies the dominant filter as resonance trapping during migration (favouring low-order rationals).

↑ back to top

2. The conjectures

Conjecture I (original) — Khinchin Selection falsified, see Test 5

Long-lived multi-planet systems concentrate on period ratios whose continued-fraction expansion has bounded partial quotients — i.e. more noble, more KAM-stable irrationals.

The per-ratio Khinchin and Lévy constants both shift in the opposite direction at z ≈ +10 (Test 5).

Conjecture I″ — Resonance Excess strongly supported

Observed period ratios show systematically larger partial-quotient geometric mean (Khinchin) and faster-growing convergent denominators (Lévy) than the Gauss-Kuzmin null. The 1-D marginal density of {r} = r − ⌊r⌋ deviates from the Gauss measure at χ² = 111 on 19 dof (Test 7), with localizable pile-ups at 3:2 and just-below-integer (Lithwick–Wu). The dominant filter on period ratios is migration-driven resonance trapping, not KAM-stability suppression of resonant tori.

Conjecture II — Resonance Lattice Rigidity supported across Farey-9, 12, 15

For an n-planet system, the resonance chain — the tuple of nearest low-order commensurabilities — concentrates on a sparse subset of the combinatorially possible chains. Concentration is robust to the size of the alphabet (Tests 4 and 6). Consecutive resonances in a chain are non-independent at coarse resolution.

Conjecture III — Arithmetic Hair not yet tested

Galactic disk substructure (moving groups, phase-space spirals) is the image, under the galactic potential's integrable flow, of rational points on a lower-dimensional resonant submanifold of action space.

↑ back to top

3. Test 1 — partial-quotient distribution vs Gauss-Kuzmin

Take all 1037 multi-planet systems from the NASA Exoplanet Archive, sort each by period, compute adjacent ratios r = Pi+1/Pi, and for each r compute the continued-fraction expansion to depth 8. Null: Gauss-Kuzmin, P(ak = m) = log2(1 + 1/(m(m+2))).

akempiricalGauss-Kuzminratio
10.4300.4151.04
20.1670.1700.98
30.0880.0930.94
40.0560.0590.95
50.0400.0410.99
60.0280.0300.95
70.0190.0230.83
80.0170.0180.92
90.0150.0141.02
≥ 100.1400.1371.02

χ² = 21.5 on 9 dof. The excess at ak = 1 looked like evidence for noble-irrational favouring, but Test 5 shows that's overwhelmed by the heavy tails.

Empirical vs Gauss-Kuzmin distribution
Empirical (red) vs Gauss-Kuzmin (dark) over 10924 ak samples.
↑ back to top

4. Test 2 — dynamical-age stratification

Norbits = stellar_age × 365.25·10⁹ / Pinner_days spans log10 5.8 to 13.0 across 689 ratio-pairs with measured age.

quartilemedian log10 Norbitsfrac(ak=1)frac(ak≥4)
Q110.430.4300.328
Q211.310.4280.299
Q311.640.4370.315
Q412.040.4340.321
Quartile trend in dynamical age
No monotonic trend across 1.6 decades of dynamical age.
↑ back to top

5. Test 3 — RV-only control for selection bias

samplen systemsn akfrac(a=1)χ² / dof
all multi1037109240.43121.5 / 9
RV-only20820720.4357.4 / 9
Transit-only68372140.43414.9 / 9
RV vs Transit vs Gauss-Kuzmin
All three samples deviate from Gauss-Kuzmin with the same per-sample magnitude. Selection bias is not the explanation.
↑ back to top

6. Test 4 — Conjecture II at Farey-9

Resonance alphabet Σ = {(p,q) : gcd=1, p > q ≥ 1, p+q ≤ 9}, |Σ| = 13. Each adjacent ratio gets the nearest symbol.

Ln systemsdistinct observeduniform expectation
2 (3-planet)21979~123
3 (4-planet)8170~79
3-planet systems in the period-ratio plane
Three-planet systems in the (P2/P1, P3/P2) plane. Grey lines = low-order Farey ratios; Trappist-1 marked in red.
↑ back to top

7. Test 5 — Khinchin and Lévy constants flips Conjecture I

Two universal constants govern continued-fraction expansions for almost every real (Lebesgue):

Both are smaller for noble-ish irrationals (KAM-favoured). Conjecture I predicted period ratios shift downward.

quantitytheoreticalempiricalSEz
⟨ln ak0.9881.1940.020+10.09
(1/n) log qn1.1871.4280.022+10.95
K = exp ⟨ln a⟩2.6853.301
Khinchin constant histogram
Per-ratio Khinchin mean over 1564 adjacent pairs. The empirical centre is decisively to the right of K₀.
Levy constant histogram
Per-ratio Lévy slope. Empirical mean > L₀; the golden ratio (ln φ) is far to the left of the bulk.
The flip: period ratios are closer to low-order rationals than random, not further. Both Khinchin and Lévy at z ≳ 10 on the opposite side of the Gauss-Kuzmin null.

Bonus: conditional dependence of consecutive partial quotients

Under Gauss-Kuzmin the (ak) are asymptotically i.i.d. Contingency table on buckets {1, 2, 3, 4, ≥5}:

ak \ ak+1=1=2=3=4≥5
=119338825193311681
=2850352172125558
=35191909366235
=43481155745154
≥51520526309172722

χ² = 173.2 on 16 dof (critical@p=0.01 ≈ 32).

Conditional dependence heatmap
(obs−exp)/√exp from independence. (1,1) and (≥5,≥5) cells deviate negatively; (1,≥5) and (≥5,1) deviate positively — small/large alternation is favoured over repetition.
↑ back to top

8. Test 6 — Conjecture II with larger alphabets and Markov-chain test

alphabet|Σ|L=2 distinctuniform exp.entropy (bits)uniform entropy
Farey-913791235.907.40
Farey-12221311766.737.78
Farey-15351662017.187.78

Top L=2 chain types, Farey-12

chaincount
2:1 — 2:114
2:1 — 7:46
2:1 — 11:15
2:1 — 7:34
5:3 — 7:44
7:3 — 7:34

Markov-chain independence test

alphabetdofχ² rawχ² mergedverdict
Farey-9144 / 100259.9142.0significant
Farey-12441 / 196493.4210.6marginal
Farey-151156 / 256864.6247.8underpowered
Markov-chain heatmap
Deviation from independence in (s₁, s₂) for Farey-12 chain symbols (top-12 shown). Diagonal excesses dominate — consecutive identical resonances cluster.
↑ back to top

9. Test 7 — Mixture fit and pile-up structure quantifies Conj I″

Take the fractional part {r} = r − ⌊r⌋ ∈ [0, 1) of every adjacent ratio (n = 1564). This pools over integer parts and gives a single 1-D density to test.

Non-parametric chi-square (20 equal bins)

baselineχ²dofcritical @ p=0.001
Gauss measure 1/((1+x) ln 2)111.21943.8
uniform on [0,1)50.51943.8

Both nulls are decisively rejected. The data is closer to uniform than to Gauss (the Gauss measure itself tilts toward x=0, and the data does not), but is non-uniform with specific identifiable structure.

Top bin-by-bin deviations from Gauss

{r} windowobservedGauss expectedzphysical interpretation
[0.50, 0.55)11474.0+4.653:2 pile-up
[0.10, 0.15)63100.3−3.72gap just past integer (Lithwick–Wu)
[0.90, 0.95)8858.6+3.84pile-up just below integer (Lithwick–Wu)
[0.05, 0.10)71105.0−3.32same gap, inner edge
[0.85, 0.90)8360.2+2.94broad 2:1 pile-up shoulder

The pile-up/gap pattern at {r} ≈ 0.90–0.95 (excess) and {r} ≈ 0.05–0.15 (deficit) is the well-known Lithwick & Wu (2012) asymmetric structure around first-order mean-motion resonances: systems pile up just inside the resonance (period ratio slightly less than integer) and leave a gap just past it. Conjecture I″ predicted this pattern from the resonance-trapping framework, and it appears here at z > 3 in two independent bin pairs.

Narrow-window pile-up at ±2% of each Farey-9 center

center {p/q}labelobservedGauss expectedratioz
0.000integer (2:1, 3:1, …)4467.40.65−2.85
0.2505:45572.20.76−2.03
0.3334:37167.71.05+0.40
0.5003:28460.21.40+3.07
0.6675:36654.21.22+1.61

The narrow ±2% windows confirm: only 3:2 shows a sharp pile-up at the exact resonance. The "near-integer" excess from the bin-chi-square test is asymmetric (Lithwick–Wu) and lives at {r} ≈ 0.90–0.95, not at {r} = 0 itself — which actually shows a slight deficit (z = −2.85) at the ±2% scale.

Narrow-window pile-up bar chart
Observed vs Gauss-expected mass within ±2% of each Farey-9 center. Only the 3:2 pile-up is sharp at the exact-resonance scale; the 2:1 / 3:1 / … pile-ups are broad and offset (see bin table above).

Mixture-model fit: α·Gauss + (1−α)·Farey

We model the empirical density as a mixture of the Gauss measure and a KDE-style mass concentrated at Farey-N centers with bandwidth σ:

p({r}) = α · 1/((1+{r}) ln 2) + (1−α) · (1/Z) ∑_{(p,q) ∈ Σ_N} w_{p,q} · K({r}; {p/q}, σ)

MLE over a 51×41 (α, σ) grid:

Farey alphabet|Σ|ασΔLL vs pure Gauss2·ΔLL
Farey-950.020.500*+28.557.1
Farey-1280.020.500*+28.657.2
Farey-15120.020.500*+28.657.3
Farey-20220.000.500*+28.757.3

*σ hits the upper grid boundary at 0.5 in every run. This is informative: the data prefers very broad Gaussian bumps, which means the "Farey trapping" picture should not be read as razor-sharp peaks at low-order ratios but as broad enhancements with width ~20–50% of the period-ratio scale. Sharp-peak mixture models are non-identifiable here.

Mixture density vs data
Empirical histogram of {r} (grey), pure Gauss measure (dark), Farey-12 component at MLE σ (red), and the best mixture (green). Vertical lines mark Farey-12 centers, with line width proportional to weight.
Profile likelihood for α
Profile likelihood for α with σ fixed at the MLE (Farey-12). The MLE favours α ≈ 0; 95% CI extends only to about 0.27. Most of the {r} measure is captured by Farey neighborhoods, not by the Gauss component.
Alphabet ablation
Farey fraction (1 − α) and ΔLL vs pure Gauss across Farey-N alphabets. The Farey weight monotonically increases as the alphabet grows (the model approaches a KDE in the limit).
What the fit tells us, cleanly:
  1. The {r} distribution is decisively non-Gauss (χ² = 111 on 19 dof).
  2. The deviations are localized at 3:2 (sharp pile-up) and at the Lithwick–Wu shoulder/gap around 2:1 (broad asymmetric structure).
  3. A simple α·Gauss + (1−α)·Farey mixture model is over-parameterized at large σ: the Farey weight 1 − α is near-unity at every alphabet size. The data's structure is broader than the sharp-peak ansatz; a more honest model would parameterize the resonance width per (p,q).
↑ back to top

10. Synthesis and what to do next

Open problems (ordered by accessibility)

  1. Build a physical mixture model: instead of constant σ, give each (p,q) its own width σp,q ∝ μ2/3/(pq) (the resonance libration scale for mass ratio μ). The two-parameter fit becomes a one-physical-parameter fit (μ-distribution + a trapping-efficiency function).
  2. Decompose the Lithwick–Wu asymmetry: fit the "pile-up − gap" signature near each first-order resonance (4:3, 3:2, 2:1, 5:4) separately and check whether the asymmetry magnitude scales as predicted by capture-from-disk theory.
  3. Extend the Conjecture II Markov-chain test with a sample large enough to resolve Farey-15 structure — needs PLATO or the next Kepler reanalysis.
  4. Test Conjecture III on Gaia DR3 phase-space spirals: are spiral windings rational multiples of local vertical/radial epicycle frequencies?
↑ back to top

11. Reproduction

# Pull data
curl -sG "https://exoplanetarchive.ipac.caltech.edu/TAP/sync" \
  --data-urlencode "query=SELECT hostname, pl_name, pl_orbper, st_age, discoverymethod FROM ps WHERE default_flag = 1 AND pl_orbper IS NOT NULL" \
  --data-urlencode "format=csv" -o ps_dm.csv

# Day-1 tests
python3 analyze.py
python3 analyze_dynamical.py
python3 analyze_rv_and_c2.py

# Day-2 tests
python3 analyze_conjecture_I_deep.py
python3 analyze_conjecture_II_extended.py
python3 analyze_mixture_v3.py

Dependencies: numpy, matplotlib.

↑ back to top