Ring Necessity

We test whether modular arithmetic is necessary, or whether the PRINCIPLES (channel decomposition, Pareto sizing) work without it. Four architectures, same abstract task: context-dependent function selection with per-channel evolution.

The experiment: replace modular multiplication (a*b mod q) with clamped multiplication (min(a*b, q-1)). This removes the ring's cyclic wrapping while keeping the channel structure. If clamp matches ring, the ring is scaffolding. If not, the ring is essential.

Four Architectures

A: RING
[8,9,25,49,11,13,17] mod
Full CRT ring. Modular multiply. The standard.
B: CLAMP
[8,9,25,49,11,13,17] saturate
Same sizes, clamped multiply. No ring. Just the shape.
C: FLAT
[19,19,19,19,19,19,19] saturate
All channels same size 19. No Pareto.
D: MONO
[997] saturate
One big channel. No decomposition.

Results

4 functions, 160 training examples, 200 generations of per-channel evolutionary search. Same task structure across all configs.

A RING
ch 762 (68%), exact 47/160
Starts at 21%. Improves +523. Genuine learning.
B CLAMP
ch 1038 (92%), exact 89/160
Starts at 63%. Improves +337. High but inflated.
C FLAT
ch 1085 (96%), exact 155/160
Starts at 83%. Improves +156. Nearly trivial.
D MONO
ch 160 (100%), exact 160/160
Starts at 100%. Improves +0. Fully degenerate.

The Finding: Information Preservation

Clamp BEATS ring on raw numbers. But the scores are artifacts of degeneracy. Clamped multiplication saturates to q-1 for most inputs when weights are large. The monolithic config gets 100% trivially -- every output is 996.

The improvement metric reveals the truth. Ring: +523 points of genuine learning from a hard start (21%). Clamp: +337 from an inflated baseline (63%). Flat: +156 from near-trivial (83%). Mono: +0 from perfectly trivial (100%). Ring creates a task where the model must genuinely learn. Clamp creates a task where the answer is trivial.

Why? Modular multiplication is BIJECTIVE (for coprime weight). Every input maps to a unique output. Wrong weights produce completely wrong outputs -- strong evolutionary signal. Clamped multiplication is LOSSY: many inputs map to q-1. Wrong weights still produce q-1 for most inputs -- weak signal, trivial matching.

The ring is necessary not as algebraic scaffolding but as an information-preserving arithmetic substrate. The value of modular arithmetic is bijectivity: it preserves information through cyclic wrapping.

Context: Three Experiments

Experiment 1
Constraints don't help
Inter-channel constraints (3^2-2=7, etc.) add no prediction advantage. Enforcing awareness hurts by -126 ch, -79 exact.
Experiment 2
Physics = coincidence
Any 7 primes match physics targets (dense expression space). Template-specific p=0.002 but post-hoc.
Experiment 3
Ring = preservation
Ring is necessary for non-degenerate computation. Its value is bijectivity, not algebraic constraints.
Verdict
Technique + substrate
CRT = universal technique. Ring = necessary (bijectivity). Constraints = decoration. Physics = coincidence. ~2000 theorems = real algebra.

Source code · Public domain (CC0)

Report issue

.ax source compiled to WASM via self-hosting compiler. Zero HTML authored.