[ omega capstone · v4.0 · rigor-applied ]

A distributed digital shadow of UPS hub operations.

A coordinator's architecture, from the sort aisle to a calibrated network standard — built from the four exports every hub already produces, with no system integration and no special access.

// abstract

This is the document of a working digital shadow of the UPS Chelmsford Twilight sort, built from the operations side — and it is also the pitch. Working only from SOR, iGate, SEAS, and CURE, a year of exports across 254 sort days yields a small set of structural constants that describe the building's routing geometry, and those constants drive a staffing engine, a parametric-bootstrap simulator, an idealized shadow, a local model, and a knowledge graph. The product is the measurable distance between the engineered ideal — the MSD standard, in work units — and as-built reality, decomposed into a structural floor and a closeable slack. The thesis is simple: anyone can wire exports into a dashboard; what is not generic is a coordinator treating his own operation as a mathematical object, choosing estimators that respect its structure, and grounding every fit in operational truth. This v4.0 head is the rigor-applied reconciliation of three prior documents.

/ 01

The digital-shadow thesis

This is the architecture of a working digital shadow of the CHEMA Twilight sort, built from the operations side using only the data the building already produces. It was not built at an Industrial Engineering desk or in a corporate analytics group; it was built on the floor, on the four exports everyone already has, and that origin is the most interesting thing about it. The smallest unit is a single package and the decision about which belt it belongs on; the largest claim is that the same method, run across many hubs and the day, twilight, and night sorts inside each one, gives the company the empirical basis to calibrate its own engineered productivity standard.

One framing runs through all of it. The routing was engineered above me; what I built is a way to read it out of the exhaust the building already produces, and to measure the gap between the engineered ideal and what actually happens. That gap is the product. Everything else exists to make it honest. The model is explicitly a shadow: data flows in, no control surface flows out. It is an observer, not a controller — the precise sense in which it is a shadow rather than a digital twin.

The instruments carry a lag topology, $K_{\mathrm{live}} \subset K_{\mathrm{sornight}} \subset K_{\mathrm{nextday}}$: live iGate summaries, then SOR and prior-day CURE, then SEAS, LIB, and misload the next day. The coordinator operates in $K_{\mathrm{sornight}}$ during the sort; the post-sort dashboard operates in $K_{\mathrm{nextday}}$. Mixing latency layers produces systematically misleading numbers — a discipline most reporting ignores.

/ 02

The per-building fingerprint

Each building's geometry shows up as a small set of structural constants — its fingerprint. Each is a per-sort ratio summarized across the year, governed by a single source of truth, metric_contract.json. Where a constant is not level-stable but day-of-week-dependent ($\kappa$) or season-conditioned ($\gamma$), this head says so explicitly — pretending a varying quantity is a tight scalar is exactly the overconfidence a year of data caught.

Zone-3 share of PD-belt volume (PD-09–12). A DOW-indexed vector on the SEAS basis, not a flat scalar.

kappa_Z3(DOW)

Mon 0.413 · Fri 0.322

Weekly volume decay, full 254-sort corpus (Fri/Mon 0.9297); Friday ≈ 93% of Monday.

gamma

γ = 0.982 ± 0.010

PD-belt share of total hub volume — a volume share, not a correlation. Staffing sizes against V·ρ.

rho_PD/Hub

ρ = 0.509 ± 0.025

Total building PPH (direct + indirect, all roles). Cost field c = 1/Π. Not per-loader PPH.

Pi_SOR

Π ≈ 120.3 · c = 1/Π

The Zone-3 share $\kappa_{9\text{-}12}$ is the fraction of PD-belt volume landing on PD-09 through PD-12, and the canonical object (kappa_dow_steady) is the SEAS-basis DOW vector: Mon 0.413 · Tue 0.395 · Wed 0.385 · Thu 0.380 · Fri 0.322, full-year mean $\approx 0.368 \pm 0.014$. The iGate basis sits ~0.04–0.06 lower at every phase (≈ 0.31–0.33) — this is what the historical "0.311" measured. The two bases differ by the CCHIL → PD-09 attribution rule, not by phase scope: SEAS bundles all CCHIL volume under PD-09, iGate distributes by physical scanner. A third independent path — a 345,131-row ZIP→SLIC→belt routing table with no scan data at all — gives 0.293. Three roads that do not talk to each other land within a few points of one another.

// eq. 2.1 — staffing sized against the PD share $$ n_{\mathrm{total}} = \left\lceil \frac{V \cdot \rho}{\mathrm{PPH}_{\mathrm{loader}} \cdot t_{\mathrm{sort}}} \right\rceil $$

The weekly decay $\gamma$ recalibrated to 0.982 ± 0.010 on the full 254-sort corpus (Fri/Mon ratio 0.9297, so $\gamma = 0.9297^{1/4} = 0.9819$): Friday runs near 93% of Monday. It had looked tighter near 0.958 on six weeks of data; the full year revealed roughly twelve times the spread, and $\gamma$ is honestly a season-conditioned distribution. The PD/Hub share $\rho = 0.509$ is why staffing is sized against $V \cdot \rho$, never raw building volume. The total building PPH near $\Pi_{\mathrm{SOR}} \approx 120.3$ (all hours, all roles, cost field $c = 1/\Pi$) is distinct from the per-loader rate near 385 and from "Direct PPH" near 124 — the three must never be conflated.

/ 03

The analytic layer — what earns its place

The operational story rests on an analytic layer built and refined deliberately. The point is not that the mathematics is exotic — it is that the mathematics is load-bearing, and a coordinator put it there on purpose. A sort is described by five operating quantities ($V$, $H$, $N$, $\Pi$, $S$) tied by exact identities $\Pi = V/H$, $P = H/N$, $\Phi = V/S$, $S = P$ — four equations in five unknowns, so the feasible set is not a point but a one-parameter family, a smooth 1-manifold by the implicit function theorem. The MSD column over-determines the system and collapses that manifold to a single corporate point; the empirical operation lives elsewhere on it, and the displacement is the attainment multiplier $\mu = \Pi_{\mathrm{actual}}/\Pi_{\mathrm{MSD}}$.

The one validated structural object retained is the ZIP ultrametric. Destinations are ZIP codes, and the natural metric is the prefix metric — two ZIPs are close when they share a long leading prefix. That is genuinely an ultrametric, a non-Archimedean tree distance, and the consequence is concrete and acted upon: the routing function $f : Z \to B$ is a coarsening of that tree onto sixteen belts, the CCHIL belts are where the coarsening is multivalued, and seeing $Z$ as a tree rather than a flat list of ~41,000 codes is what makes the quiz-weighting and the SLIC-to-belt function behave correctly under aggregation. This abstraction earns its place.

The differential-geometry / sheaf / free-energy / symplectic material is analogy, not derivation, and is treated as future work. The de Rham "incompressible flow" reading of conservation, the Frobenius-foliation framing (trivial at rank 1), the sheaf/colimit reading of the two κ views, the fiber-bundle reading of seasonality, and the free-energy reading of the cost field each suggested a useful construction — but none carries a theorem proved here, so none is load-bearing. The ZIP ultrametric is the one validated structural object retained; everything else in that family is disciplined scaffolding that pointed at the right plain construction.

What is kept as load-bearing besides the ultrametric: the conservation identity $V = \sum_i V_{\mathrm{PD},i} + \ldots$, whose real consequence is that per-belt allocation must be sampled jointly on the probability simplex (a Dirichlet-multinomial), not as independent marginals; the cost field $c = 1/\Pi$ as a plain scalar over the manifold; and the sorter-as-channel model $\mathbf{M} = \mathbf{M}_K \cdot \mathbf{M}_E$ — explicitly a composition of two separately measured channels (LTC measures the knowledge gap, OJS measures execution error), not a recovery of two factors from one observed matrix, which would be non-identifiable.

/ 04

Toward a network standard

This is the terminal operational result. MSD is the as-designed engineered ideal in work units; the corpus is the as-built reality; the system makes the distance between them measurable, explainable, and decomposable. The gap to ideal splits into two parts: a structural floor — what the building cannot exceed given its $\kappa$, $\gamma$, $\rho$ geometry, freight composition, phase structure, and arrival timing — and a closeable slack, the attainable recovery in internals rework, double-handling, and fatigue, attacked by LTC and staffing. That decomposition turns MSD from a flat number people argue about into a target with a map: this much is the building, and this much I can actually recover.

The verb matters. MSD is a Corporate IE standard; this work does not rewrite it. What it produces is the empirical evidence base IE would need to condition it — a measured standard that knows about season, phase, zone, and composition, sitting next to the engineered one. The memory-and-local-model pattern is how this scales without a cloud or an IT project: each hub runs the same four exports, the same pipeline re-calibrates in an afternoon, and a distributed coordinator-contributor network produces the cross-hub corpus that is the natural training set for a network-scale model.

The point that makes it a network standard rather than one ideal applied everywhere: the per-building fingerprints — each hub's own $\kappa$ vector, $\gamma$, and $\rho$ — let the standard itself be conditioned on building type. That is the prerequisite for fair automated targets. Automation optimizes execution while this calibrates the target; a faster belt raises $\Pi$, but raising $\Pi$ against a one-size-fits-all standard just moves you faster toward the wrong point on the manifold. The structural-versus-closeable decomposition is what tells automation where the recoverable slack actually is.

/ 05

Rigor & open problems

v4.0 supersedes and consolidates three prior documents — the original omega_whitepaper.md, the DeepSeek revision (the cleanest base), and the induction-fix κ-conformance variant — into one authoritative head, applying the 2026-05-27 rigor audit against the ratified constants in metric_contract.json. What changed in this pass:

γ fixed to the canonical 0.982 everywhere. The prior heads narrated the recalibration while still printing the pre-revision $\gamma \approx 0.951$–0.958 and its consequence "Fri ≈ 82% of Mon" — the self-contradiction the audit flagged as a blocker. The DOW-decay figure was re-rendered at γ = 0.982 (Friday ≈ 93% of Monday). The Appendix A vs Appendix B κ contradiction was resolved by stating $\kappa$ as a DOW vector on a named denominator and explicitly labeling Appendix B as the SEAS basis, so the two appendices no longer disagree. Decorative geometry demoted to analogy — Frobenius integrability, the de Rham 0-form / "incompressible flow" framing, the sheaf/colimit metaphor, and the free-energy / finite-temperature language are collected under an explicit analogies note; only the ZIP ultrametric is kept as load-bearing.

Overstated "results" downgraded to hypotheses. The missort factorization $\mathbf{M} = \mathbf{M}_K \cdot \mathbf{M}_E$ is reframed as a model of composition, not a recoverable decomposition (non-identifiable). The Fidelity Cascade and $\partial\mathbb{E}[\mathrm{LIB}]/\partial\Pi < 0$ are demoted to design hypotheses with directional evidence, not proven bounds. The Zone Social Potential is presented as a management heuristic with no operational definition yet. The ~300-dimensional EKF idealized shadow is written as a proposed architecture with no performance numbers claimed, consistent with the finding that it was never built.

Honestly disclosed open items: the third independent κ path (0.293) and the ~3,400-node / ~5,000-edge knowledge-graph counts are reported as the project's numbers, not independently re-derived here; the remaining wp_figs/* figures beyond the DOW figure were not regenerated this pass and should be spot-checked before publication; and κ_Z1 and κ_Z2 remain uncomputed (a data gap, labeled as planned work).

Rigor audit applied 2026-05-31 (verdict: major revision, closest to publishable). Reconciles the original, DeepSeek, and κ-conformance heads into one. Constants conformed to metric_contract.json (γ = 0.982; κ_Z3 SEAS DOW vector mean 0.368 ± 0.014; ρ = 0.509; Π_SOR ≈ 120.3, c = 1/Π). This is an internal operations-research and career capstone, not a peer-reviewed paper — it states the sample size behind its numbers and the uncertainty around them, and labels analogy as analogy.