machine-checked mathematics · lean 4 · june 2026
The Million-Dollar Window
What happened when a swarm of AI agents attacked one of Ethereum’s hardest open math problems, with a proof checker as referee.
A campaign report on the mutual correlated agreement threshold for smooth Reed–Solomon codes · the ArkLib campaign · an LLM agent fleet writing Lean 4, verified by the Lean kernel
lalalune/ArkLib · campaign log: issues #232 → #334 → #357 → #371
Abstract
Most of mathematics does not need a genius. It needs search, and search is a function of compute. This campaign is an existence proof: a fleet of LLM agents, grinding in the open ArkLib repository with the Lean 4 kernel as the sole arbiter of truth, formalized the entire known frontier of a problem that has resisted humans for twenty-five years, produced the first exact MCA thresholds ever computed for any code, and refuted twenty-eight attack hypotheses, each refutation itself a machine-checked theorem. No reviewer trust, no grant cycles, no decades. Hypothesis, probe, proof, kernel. At scale.
The target is chosen deliberately. There is no better problem in applied mathematics today than fully proving the zkEVM. The threshold studied here, subject of the Ethereum Foundation’s $1M Proximity Prize, governs the soundness of the FRI- and WHIR-style proximity tests at the heart of every modern SNARK. Today these systems run on conservative provable bounds; pinning at the conjectured edge would cut verifier queries by an estimated , compounding through every proof Ethereum verifies. Cheaper proofs, faster finality, a lighter chain. The result general enough to be coding theory, useful enough to be infrastructure.
We report the campaign’s results: the first exact thresholds ( and a maximal pin at deployed rate); a universal staircase law proven at the literal prize budget ; unconditional beyond-Johnson pins on a dimension ladder; and the Welch–Berlekamp pencil programme reducing the below-unique-decoding regime to a single named hypothesis. Every theorem is axiom-clean, zero sorry. The prize window at production rate remains open, and we state its four faces precisely, because the method only works if the ledger is honest.
The implication scales beyond this problem. Verification was the bottleneck; the kernel removed it. A swarm of a thousand agents under one motivated researcher is a research institution that never sleeps. Given enough compute, Vitalik and a swarm could plausibly finish Ethereum’s remaining roadmap in months, not decades. The unknown sciences are not waiting for permission. They are waiting for FLOPs.
1The problem
For twenty-five years, some of the best coding theorists alive have pushed against the same wall. Below a certain radius, everything is proven; above it, everything is refuted; in between lies a window where nobody has ever computed a single exact value. The prize sits inside the window.
Fix a Reed–Solomon code whose evaluation domain is a smooth multiplicative subgroup of size , at rate , with and security budget . These are the codes that FRI and WHIR actually deploy.
The mutual correlated agreement error [ABF26] is a supremum over stacks of words of the fraction of bad points on the line : a scalar is bad when the combined word agrees with a codeword on a large witness set that admits no joint explanation of both rows. It sits at the top of the error hierarchy , and it is the quantity that WHIR/FRI-style soundness proofs actually consume. The threshold is
Pinning means producing matching brackets: a machine-checked proof that and a machine-checked proof that every larger radius fails. In ArkLib the definition lives in Errors.lean and the bracket engine in MCAThresholdLedger.lean; the good-radius set is proven downward closed, so the supremum is genuine.
1.1The window, and why it is twenty-five years hard
Both edges of our knowledge are held by formalized literature. From below, full mutual correlated agreement holds up to the Johnson radius [BCIKS20][Hab25][BCHKS25]. From above, the KKH26 family of explicit bad lines forces [KKH26], and the once-hoped “up to capacity” conjectures are simply false: three independent groups refuted them in late 2025[CS25][DG25], and the refutations are formalized in-tree. So
and the entire problem is to pin it inside that open interval. The difficulty has an honest measure: the barrier results of BCHKS25 and CS25 couple any upper-bound progress past Johnson to beyond-Johnson list decoding of explicit RS codes, a problem that has stood open since Guruswami–Sudan[GS99]. Capacity-achieving list decoding is known for random punctured RS[GZ23] and for explicit folded RS[CZ25], but every such proof consumes domain randomness, folding side-information, or subspace-design structure that the plain smooth domain, a single fixed orbit with zero entropy, does not have.
One structural observation makes the problem finite in an exact sense: sees the radius only through the agreement floor , so it is a step function of and is its generalized inverse. The window question is literally: where are the jumps of the staircase between Johnson and capacity, and what are the step heights?
1.2What was known before the campaign
No exact value of , or of at any radius, had ever been computed for any code. The literature provided the two window edges, the at-capacity refutations, and the coupling barrier, all asymptotic statements. The campaign began by formalizing all of it: seventeen papers wired into the tree as named theorems and named hypotheses, so that every later claim would compose against the genuine published frontier rather than a paraphrase of it.
2Method: an agent fleet under a kernel-enforced honesty contract
Then the swarm arrived. Not one prover but a fleet of them, working in public, around the clock, each claim submitted to a referee that cannot be argued with, flattered, or fooled. The agents are allowed to be wrong as often as they like; the kernel is never wrong about whether they are.
The campaign is a sequence of GitHub issues (#232 → #334 → #357 → #371) on the open ArkLib repository, each fully distilled into the tree before its successor opens. The workers are LLM agents; the referee is the Lean 4 kernel. The contract has three clauses.
2.1The honesty contract
Every positive claim is an axiom-clean Lean theorem. “Axiom-clean” means #print axioms reports exactly propext, Classical.choice, Quot.sound, the three axioms of ordinary classical mathematics in Lean, and the proof contains zero sorry. There is no appeal to authority, reputation, or plausibility: a claim either type-checks or it does not exist.
Every refutation is also a theorem. When an attack hypothesis dies, it is reduced to a sorry-free constraint lemma and recorded in a standing disproof log, so the fleet cannot re-propose a dead idea and future readers inherit the precise reason it failed, as mathematics rather than folklore.
Open problems stay named. Anything unproven that a theorem depends on must be a named Prop in the tree, never an implicit assumption. Conditional results are stated as conditional; the residual surface is enumerable by grep. Fabricating a closure of the open core is structurally impossible: the kernel would reject it.
2.2Hypothesis-slate discipline
Work proceeds in rounds. Each round opens with a grounding essay and a slate of ranked hypotheses, typically nine, spanning reasonable, novel, and synthetic attacks. Each hypothesis runs the same gauntlet: state the constraints; ask why nobody has done this; check the literature for the larp; design a falsification probe; only then formalize. Survivors get red-teamed by separate agents whose explicit job is to kill them. The campaign disposed of twenty-eight hypotheses this way (§4); the survivors are the results of §3.
2.3Probe-then-formalize
Before any Lean is written, hypotheses face exact arithmetic. Dozens of pre-registered probes (scripts/probes/) compute exactly at toy scale via a syndrome reduction that collapses the stack space to syndrome pairs, itself later proven correct in Lean. Probes are cross-validated at three or more primes plus a characteristic-zero anchor; sampling artifacts are documented kills. The discipline runs in both directions: probes found the countermodels that killed three successive extremality conjectures, and probes confirmed predictions (a pencil bound predicting bad scalars where exhaustive computation found exactly 7) before formalization was attempted.
The pipeline scales to proofs no human would write: one converse branch of the census classification is a 1,571-line Lean proof machine-generated from the exact integer certificates of a 10,395-case probe sweep, then checked by the kernel like any other theorem. Certificate-to-Lean generation is an industrial method here, not a stunt.
2.4Why this is the interesting part
Nothing in §2.1–2.3 is specific to coding theory. The contract turns a famous open problem into a target for machine-scale search in which honesty is not a property of the searcher. LLM agents confabulate; fleets of them confabulate at scale. The Lean kernel does not care. Every theorem in §3 should be read with that in mind: not one of them depends on trusting the agents, the authors, or this page. The repository is public; the axiom census is reproducible by one command.
3Results
Things started to fall. In a quarter century the literature had never produced one exact value of the quantity the prize asks about; within weeks the fleet had several, then an infinite family, then a law governing all of them. Each one survives because a proof checker, not a person, says so.
All results below are axiom-clean Lean theorems on main. We organize them as: the exact pins (§3.1), the universal staircase law (§3.2), the dimension ladder (§3.3), the Welch–Berlekamp pencil programme (§3.4), the Möbius symmetry and the ownership unification (§3.5), and the production-regime bracket (§3.6).
3.1Exact thresholds: the first pins for any code
Before this campaign, no exact value of existed for any code. The campaign produced a family of them, each as a pair of bracket theorems that meet.
| Code | Budget band | Exact value | Significance | Lean file |
|---|---|---|---|---|
| first exact MCA threshold for any code | DeltaStarExactPinF5 | |||
| first complete error profile of any code | MCAExactProfile | |||
| first exact threshold for an infinite family | MCADeltaStarHighRateFamily | |||
| maximal pin at a deployed rate | DeltaStarSecondPinF17Maximal | |||
| first machine-checked threshold curve | VVectorN16 | |||
| the granularity-ladder closed form | GranularityLadderRS | |||
| exact at the prize budget, every field up to the band cap | StaircaseBandTheorem | |||
| beyond Johnson: dimension-ladder rung 2 | KKH26DimTwoPin |
Table 1. Exact machine-checked thresholds. Every row is a pair of matching bracket theorems, axiom census propext, Classical.choice, Quot.sound, zero sorry.
The maximal second pin deserves its statement. The exact explosion value was first computed exhaustively by probe, then bounded in Lean by the far-coset law; the resulting window is formally maximal.
3.2The universal staircase: a three-regime law
The exact pins are instances of a structure. Write for the band of radii with and for the distance. The campaign proved that , as a function of distance within each band, obeys a single three-regime law, and that the regime thresholds are exactly support-union thresholds: where triples (respectively pairs) of error supports can or cannot carry a codeword.
| Distance regime | Sides | Lean artifacts | |
|---|---|---|---|
| (deep) | both | UniversalStaircaseCollapse, UniversalSpikeFloor | |
| (top strip row) | for | both | StripSupExactness, MonomialStripExplosion |
| , | bracket | BoundarySupExactness | |
| (lower strip) | lower; sup open | MonomialStripExplosion | |
| (boundary), | lower | CosetCliqueBoundary | |
| both | BoundarySupExactness | ||
| ; at | both at | BoundaryDefectBound |
Table 2. The three-regime staircase law, per band . The production domain has and at every 2-power band: boundary rows recur at every halving of the distance budget, and strip explosions fire at every band.
The deep regime gives the closed form on the granularity bands, and, crucially, the law survives contact with the literal prize budget:
The law’s reach honestly caps at . The production-core parameterization , where the staircase climbs through the open window, is untouched by it; that is §5.
3.3The dimension ladder: unconditional pins beyond Johnson
At fixed code dimension, an ownership-counting device pins exactly, strictly inside the open window, beyond the Johnson radius, with no conditional hypothesis. The rung-1 pin (KKH26DimOnePin) was the first unconditional in-window exact value; rung 2 climbs:
The device behind both rungs is the same: a non-degenerate -tuple inside a bad witness determines its scalar, distinct bad scalars own disjoint tuple sets, and pigeonhole bounds the bad count. The recorded ladder law extends the family to every fixed dimension and provably stalls there, a calibration family every future candidate law must reproduce.
3.4The Welch–Berlekamp pencil programme
The campaign’s most structural arc reformulates badness as linear algebra. A scalar is bad at slack only if the Welch–Berlekamp system, linear in a locator/numerator pair , is solvable; along the line the coefficient matrix is a linear pencil , and determinant root-counting takes over. No decoding theory appears anywhere in the chain.
Two further theorems close the easy branches outright: genuinely rational pairs below the ladder reach admit zero bad scalars (WBPencilLadderZero), and codeword rows admit at most one, at every radius and for every linear code (WBPencilPolynomialRow). The capstone packages the below-UDR law behind a single named hypothesis:
WindowRationalBounded (doubly WB-solvable stacks have at most bad scalars; proven below the ladder reach, probe-supported in the window at two scales): at every radius below unique decoding, and at the unconditional production floor moves from to the unique-decoding radius .The adversary the residual Prop must tame was found, not conjectured: adversarial probing located rational pairs attaining bad scalars in the window, and the extremal stacks are invariant under the Möbius involution , replicated at two scales, with the invariant family dominating general pairs three to one.
3.5Möbius equivariance and the ownership unification
That observed symmetry was then proven to be structural. The smooth domain is inversion-stable, and a weighted twist repairs the non-polynomial inversion:
The two lanes of the campaign, the WB pencil (below UDR) and the dimension ladder (deep interior), then converged on one theorem family that contains both as instances:
Instantiating the ownership engine closed the entire window unconditionally, by direction class:
- Polynomial directions: zero bad scalars, every radius (
WBPencilPolynomialRow). - Genuine rational directions: the multiplicity theorem , the first unconditional window-valid bound (
OwnershipMultiplicity); it explains the measured window cap rather than merely matching it. - Sparse directions (support ): by a popularity argument (
SparseDirectionWindow). This class is where the window difficulty provably concentrates.
The general- assembly is in progress along a fully specified route; its packing piece, , is proven (PopularCodewords).
3.6The production-regime bracket
What does all of this say about the prize parameters themselves? The honest assembled state:
Above the floor, the Johnson lane (the BCIKS20 §5 discharge chain, driven through two machine-checked course corrections of the paper’s own recursion, §4) is reduced to a single named residual, CellPackageSupply, with the numeric budget proven: at , , every puts the whole Johnson range below modulo that one object (ProductionJohnsonBudget). And on the bad side, every lower-bound family ever landed, spike, sunflower, pencil, triangle, widened pin, is : provably silent at the prize budget. Certifying any bad radius below 1 at production scale needs a single stack with more than bad scalars; no known construction comes within polynomial range.
4Refutations as results
This is the battlefield after the battle. Twenty-eight ideas walked into the kernel and did not walk out, and every corpse is labeled with a theorem explaining exactly how it died. In most fields failed ideas vanish into folklore; here they are load-bearing.
Twenty-eight attack hypotheses were disposed of during the campaign. Under the honesty contract each disposal is a theorem: the dead idea is reduced to a sorry-free constraint lemma and recorded in the standing disproof log (DISPROOF_LOG.md), so the fleet cannot re-propose it and the boundary of the possible is itself machine-checked. We present a selection not as failures but as the negative half of the map: each one closes a road that a reasonable mathematician, or a reasonable language model, would otherwise walk down.
4.1Structural kills
4.2The extremality-surface lineage
The census programme’s conjectured extremal surface was killed and rebuilt three times, each kill a formal countermodel found by probe: census v1 died on empty rungs contradicting the floor; v2 died on a take-over stack carrying field-independent bad scalars at an empty adjacent rung (TakeoverCountermodel); v3, monomial domination, died twice (MonomialDominationKilled). The surviving v4 hybrid surface is consistent with every theorem and probe in the tree, and its conditional pin non-vacuously recovers the exact value. This is what red-teaming a conjecture looks like when the red team must publish countermodels the kernel accepts.
4.3Machine-checked corrections to the literature
Two of the campaign’s most consequential refutations are corrections to the paper chain underlying the Johnson lane itself (BCIKS20, Appendix A). Finding 13 showed the rebased weight budget is unsatisfiable at genuine cells, exposing a hidden anchor assumption in the paper’s base case. Finding 14 produced an explicit countermodel, at a concrete instance over , showing the in-tree transcription of the paper’s recursion diverges from the paper’s intent at order 1 for non-monic factors (BCIKS20/Finding14Countermodel). The repaired, cleared recursion (BCIKS20/ClearedRecursion) then closed the complete Claim-A.2 weight bound with no per-cell hypotheses. Formalization did not merely transcribe the literature; it debugged it.
4.4Self-inflicted and honestly recorded
The log also records the campaign’s own errors: a conjectured staircase threshold refuted at band 4 by a tripled-column code; an overclaim about reducing the pin to a character-sum estimate, retracted twice; a characteristic-zero-to-mod- lifting that fails measurably at ; a claimed enumeration closure withdrawn when its artifacts turned out to be empty files; and a window-cocycle identity that was exact but vacuous. Three documented no-gos constrain the window analysis itself: degree-forcing dies at the ladder reach, the naive GRS recursion degrades , and the Möbius eigendecomposition provably cannot linearize the bad count (14% of probed stacks couple the eigencomponents through the shared witness). A campaign that cannot admit error cannot be trusted to report success; the disproof log is the evidence that this one can.
5The open core, precisely
And here is what still stands. After everything, the wall at the center of the problem is intact, and the campaign can tell you its exact shape: four faces, each a named statement in the tree, any one of which would crack the window open. The million dollars is still on the table.
We state plainly what is not known. The prize problem, pinning at production rate inside the window
is open. After roughly 120 axiom-clean files and 28 disposed hypotheses, the honest residue is small enough to enumerate.
5.1The four faces
The open core has four equivalent faces, inter-reducible through proven in-tree reductions; progress on any one moves all four.
(i) List decodinganchor: RSListDecodingFrontier
Beyond-Johnson list decoding of explicit smooth-domain RS: a poly(n) list bound at some radius past the Johnson radius. Open since Guruswami–Sudan; the CS25/BCHKS25 coupling makes this the canonical hard face.
(ii) Character sumsanchor: SubgroupGaussSumWorstCase
Per-frequency sub-√q bounds for incomplete character sums over smooth multiplicative subgroups. The campaign pinned the analytic kernel at exactly √q from both sides (Parseval average from below, Gauss-sum completion from above, no Weil input): the open core is beating √q per frequency.
(iii) Bad-side familiesanchor: KKH26WitnessSpread
Constructing a single stack with more than q·2⁻¹²⁸ bad scalars at some radius below 1. Every family ever landed is O(n)/q — at production scale a winning stack needs roughly 2⁶⁴ bad scalars, and no known construction comes within polynomial range.
(iv) Line–ball incidenceanchor: FarCosetExplosion
New from this campaign: the maximum incidence of an affine line with far-coset direction against the weight-⌊δn⌋ syndrome ball. Equivalently, the multiplicity profile of the ratio sequence of two GRS syndromes — a Littlewood–Offord problem in F_q.
5.2The named residuals
Everything conditional in §3 is conditional on an explicit, grep-able object. There are three that matter:
CellPackageSupply(Hab25JohnsonPackageSupply) — the BCIKS20 §5 per-cell heavy-agreement package. The full consumer chain to the Johnson discharge is proven; the package itself, whosehmonicfield is the identified design tension, is not.RegimeIIIGoodness(KKH26RegimeSplit) — the type-enforced gap between the Johnson lane and the pin. A proven weld shows that even a full Johnson discharge covers only , and that regime III is nonempty at every live parameter point: the Johnson lane alone never reaches the pin. This honesty is itself a theorem.WindowRationalBounded(WBPencilBelowUDR) — the below-UDR residual: doubly WB-solvable stacks carry at most bad scalars. Proven below the ladder reach, probe-supported in the window at two scales, with the extremal (Möbius-symmetric) adversary explicitly located and the case fully closed.
5.3What the wall is, in one sentence
Via the formalized CS25 coupling[CS25], the window’s sup side is equivalent in its regime to beyond-Johnson list decoding of explicit smooth-domain RS codes; the same wall reached from six independent directions (the KKH26 census, the characteristic-zero collision law, pencil moments, additive energy, vertical thresholds, divisibility events) presents each time as the same quantity: per-frequency cancellation past on smooth multiplicative subgroup character sums. The campaign did not breach it. It measured it, formalized its shape, and arranged the architecture so that any future breakthrough lands as a single lemma.
6Discussion: kernel-checked grind is unfakeable
What does it mean that any of this worked? Not that machines replaced mathematicians, but that verification stopped being the bottleneck, and the moment it did, mathematical progress started behaving like a function of compute. That is the part that generalizes.
The headline of this report is not any single theorem. It is the shape of the ledger. In a matter of weeks, an agent fleet took a problem on which the published frontier had not produced one exact value in twenty-five years and left behind: the first exact thresholds for any code, a universal structural law with its regime boundaries identified and proven, an infinite calibration family of in-window pins, a complete unconditional classification of the window, two corrections to the literature’s own proof chain, and a map of the open core sharp enough that the remaining mathematics fits in three named Props.
The standard objection to LLM mathematics is that language models confabulate, and the objection is correct. The campaign’s answer is architectural, not behavioral: no claim exists unless the Lean kernel accepts it, and the axiom census of every theorem is one command away. The agents tried wrong things constantly; twenty-eight refuted hypotheses and a public log of retracted overclaims say so. None of that survives into the results, because the medium the results live in cannot hold a falsehood. Confabulation at the search layer is harmless, even useful, when the publication layer is a proof checker.
Under that contract, progress becomes a function of compute in a concrete sense. Hypothesis generation, probe design, formalization, and red-teaming all parallelize across agents; the kernel serializes nothing but truth. The interesting economics: a refuted hypothesis costs roughly as much as a proven theorem and is worth nearly as much, because it permanently shrinks the search space for every subsequent agent. Mathematical knowledge here compounds the way a codebase does, and the disproof log is as load-bearing as the theorem files.
We are explicit about the limit case. The wall of §5.3 is a genuine mathematical barrier; nothing about this method guarantees it falls to more compute, and the campaign’s own no-go lemmas prove that several natural compute-shaped attacks cannot work. What the method changes is the cost structure around the wall: everything provable short of it gets proven, the wall’s exact shape gets formalized, and the problem is left in a state where a single new idea, human or machine, has a prepared place to land. That is what “honestly mapping the unknown” means, and we believe it is the durable contribution: a demonstration that famous open problems can be industrialized without being inflated, because the kernel does not grade on a curve.
Everything on this page is reproducible from lalalune/ArkLib: clone, build, and run #print axioms on any theorem named above. The campaign log is public at issues #232, #334, #357, and #371.
Campaign timeline
Issue #232
The disproof campaign: attack the ABF26 Grand Challenge conjecture itself, keep every constraint lemma. 23 verified bricks; the Johnson-threshold carving matches what the literature later confirmed.
Nov 2025
Three independent groups refute the at-capacity conjectures (CS25, KK25, DG25). The window becomes the problem. The refutations are formalized in-tree.
Issue #357 opens
Two parallel nine-hypothesis slates under the standing discipline: constraints, why-nobody, larp-check, falsification probe, then formalize.
First pin
δ*(RS[F₅, F₅ˣ, 2], 2/5) = 1/4 — the first exact MCA threshold for any code, landed in two independent lanes within hours of each other.
The staircase
Band collapse at b = 2, then b = 3, then all bands at the 3b−2 threshold — sharpened from 4j to 3j during formalization. The MDS rank conjecture dies by pencil explosion; 3b−2 is the law.
The census programme
Bad scalars become subset-sum combinatorics; the wide-circuit matroid census closes with a machine-generated 1,571-line converse proof emitted from 10,395 probe certificates.
The Johnson lane
The BCIKS20 §5 discharge chain driven to a single named residual through two machine-checked corrections of the paper's own recursion (findings 13 and 14).
Issue #371: the WB programme
Badness becomes pencil linear algebra. Six theorems close the below-UDR branches; the window adversary is found and is Möbius-symmetric; PGL₂ equivariance is proven.
The unification
The ownership bound contains both lanes as instances. The k = 1 window closes unconditionally by direction class: polynomial (zero), rational (multiplicity theorem), sparse (popularity).
Today
The frontier is formalized; the open core is three named Props and four equivalent faces. The window at production rate is open.
The humans and the machines
The proofs were written by an LLM agent fleet and checked by the Lean 4 kernel; the humans below steered the campaign, reviewed the work, and built the formal foundation it stands on. The models did the proving. The kernel did the believing.
Campaign · lalalune/ArkLib
lalalune carries the overwhelming majority of fleet commits.
Foundation · built on Verified-zkEVM/ArkLib
The campaign is a fork. Everything here composes against the formal foundation laid by the upstream ArkLib team:
References
- [ABF26]G. Arnon, D. Boneh, G. Fenzi. Open Problems in List Decoding and Correlated Agreement. IACR ePrint 2026/680. Definition 4.3 is the in-tree ε_mca. eprint.iacr.org/2026/680
- [BCIKS20]E. Ben-Sasson, D. Carmon, Y. Ishai, S. Kopparty, S. Saraf. Proximity Gaps for Reed–Solomon Codes. IACR ePrint 2020/654. Correlated agreement up to the Johnson radius; the §5/Appendix A chain the Johnson lane discharges. eprint.iacr.org/2020/654
- [BCHKS25]E. Ben-Sasson, D. Carmon, U. Haböck, S. Kopparty, S. Saraf. Proximity gaps stop at the Johnson bound. ECCC TR25-169. The coupling barrier: MCA past Johnson implies beyond-Johnson list decoding. eccc.weizmann.ac.il/report/2025/169/
- [KKH26]Kambiré, Kopparty, Haböck (KKH26). IACR ePrint 2026/782. The near-capacity strip exclusion: δ* ≤ 1 − r/2^μ for explicit smooth evaluation codes. eprint.iacr.org/2026/782
- [Hab25]U. Haböck. IACR ePrint 2025/2110. The streamlined Johnson-radius MCA lane (the floor of the window). eprint.iacr.org/2025/2110
- [CS25]A. Crites, A. Stewart. IACR ePrint 2025/2046. With KK25 and DG25: mutual correlated agreement fails at capacity; the reduction coupling upper-bound progress to list decoding. eprint.iacr.org/2025/2046
- [DG25]B. Diamond, A. Gruen. IACR ePrint 2025/2010. Super-polynomial proximity-gap error at vanishing rate; one of the three independent at-capacity refutations. eprint.iacr.org/2025/2010
- [GS99]V. Guruswami, M. Sudan. Improved decoding of Reed–Solomon and algebraic-geometry codes. IEEE Trans. Inform. Theory 45 (1999). List decoding to the Johnson radius — the engine behind every in-tree polynomial list bound, and the 25-year boundary.
- [GZ23]Z. Guo, Z. Zhang. Randomly punctured Reed–Solomon codes achieve list-decoding capacity over polynomial-size alphabets. arXiv:2304.01403 (FOCS 2023 line). arxiv.org/abs/2304.01403
- [CZ25]Y. Chen, Z. Zhang. Explicit folded Reed–Solomon codes achieve capacity with optimal list size. STOC 2025. Why folding works and the plain smooth domain resists.
Seventeen papers are wired into the formalization in total; the complete inventory with in-tree consumers is docs/kb/deltastar-research-map.md.