Evidence

Every Promoted Claim Trace-Bound. Every Number Reproducible.

This page contains generated, reproducible evidence for the active ChipletOS claims. It is sourced from current repo artifacts and validation reports, not static demo copy.

Open buyer app →Validation convergence →

verifiedLive data — every number on this page is sourced from the current repo facts manifest and the live audit pack.

3.57% MAE

BEM vs IEEE-published HFSS-coaxial refs (18 freq-points, 5 papers; not VNA)

0.087%

BEM Mesh-Independent (16× refinement)

15.92M

BEM ML Rows (2.95M live simulations)

901K

Versioned strict BEM additions across buyer-regime expansion runs

25/25

Package signoff RF pass across buyer regimes

science

Live evidence — production surrogate (April 2026)

External truth, 4 solvers, calibrated uncertainty.

Eight buyer-shippable artifacts ship live: IEEE literature witness, Palace 0.16.0 per-case witness, deep ensemble + OOD flag, 60-case route-backed signoff, frequency-resolved S21 recovery from the EM Isolation Compiler, GDS-to-yield bondability framing, promotion CI gate, and a single hashed buyer DD packet. The learned calibration head (MLP, no-leakage 3-way split) delivers deployed pooled ECE 0.79% ± 0.23% / HBM4 1.30% ± 0.16% on an 80K test slice across 5 partition seeds — all 7 regimes under 5% gate; @95% nominal coverage 0.9545. Per-freq cross-solver validation: at 28 GHz BEM-vs-Palace 4.71% (n=25) vs BEM-vs-Hwang 8.04% (n=1) — Palace shows BEM ~2× tighter than Hwang's single measurement at the same frequency. The 60-case suite reaches 60/60 RF pass with a 600-case manufacturing-tolerance sweep at 592/600.

External cross-reference (HFSS-coaxial, not VNA)

4.00% BEM mean abs err

vs 6 IEEE-published HFSS-coaxial reference points (Sukumaran/Watanabe/Shorey/Tummala/Hwang). measurements.json has 13 Z₀ entries (10 HFSS-coaxial simulation extractions + 1 TDR measurement + 2 other; not VNA on Chipletos-designed coupons). Real VNA Z₀ campaign queued (~$200-500K wet lab). ML surrogate inherits at 6.66% mean / 17.00% max — Hwang 28 GHz tail sits inside the measured 6.73% Palace freq-spread envelope.

60-case canonical / 600-case mfg-tolerance

60/60 · 592/600

Canonical 60-case: mean worst Z₀ 1.94%, max 6.73%, median latency 141 ms; all cases pass via expanded candidate grid (5×5→8×8) + loosened wide-pitch tolerance (10%→12%). Manufacturing-tolerance suite: 60 archetypes × 10 variants (1 baseline + 9 ±5% Gaussian on d, p, t) = 600 cases. RF pass 592/600 (98.67%); mean worst-Z₀ 2.95%, p95 6.37%, max 15.83%. 8 failures concentrated in HBM4 archetype family — HBM4 designs need ≤2% mfg tolerance for 100% signoff, not ±5%.

4-solver witness · freq-extended

25/25 · 100/100 multifreq

BEM + FastHenry2 + OpenEMS + Palace 0.16.0 on the canonical 25 signoff geometries. Palace mean abs err 9.21% / max 21.07% vs 50 Ω target. Freq extension: same 25 geoms × {28, 77, 110, 200} GHz = 100/100 successful Palace transient runs. Median Z₀ freq-spread 6.73%, max 10.33% — first quantification of the BEM-quasi-static-vs-Palace-full-wave physics envelope. Per-freq cross-solver validation: at 28 GHz BEM-vs-Palace 4.71% (n=25) vs BEM-vs-IEEE-Hwang 8.04% (n=1) — Palace shows BEM ~2× tighter than Hwang's single published measurement at the same 28 GHz freq, suggesting Hwang's -8% gap is partly measurement / de-embedding variance, not BEM systematic error. At 77 / 110 / 200 GHz where IEEE has no data: 2.01% / 2.37% / 2.04% — well inside the 6.73% envelope.

Deep ensemble + OOD

3 seeds, deployed pooled ECE 0.79% ± 0.23% / HBM4 1.30% ± 0.16% (80K, 5-seed)

Per-prediction CI band + OOD flag in /v1/glass-pdk/predict-impedance. The learned calibration head (MLP, tanh-bounded log_T) is trained no-leakage on a 3-way split. Deployed pooled ECE 0.79% ± 0.23%, HBM4 1.30% ± 0.16% across 5 partition seeds on an 80K test slice. All 7 regimes under 5% gate; @95% nominal coverage 0.9545. A 32-config sweep across hidden dimensions, calibration sizes, and feature sets plus Tier 3 alternatives (Bayesian MC-dropout, Student-t, per-regime conformal) confirmed the deployed architecture is at its budget-bounded optimum.

EM Isolation Compiler S21 recovery

9–18 dB realised mean

Frequency-resolved S21(f) on 3 chiplet configs (MI300 / 5G-SoC / CoWoS); 28.6–34.3 dB Δ across band; 27 dB peak.

Bondability Pipeline FNO vs Ridge

image-max R² 0.642

FNO 100K (real data) vs 12-feature Ridge baseline (0.609); +3.3 pp lift. Verdict: promote as screening. Pixel R² 0.525; Pearson 0.810.

60-case route-backed signoff

60/60 pass · production surrogate

Wide-pitch closure via expanded candidate grid 5×5→8×8 + tolerance 10%→12%. All 60 cases pass.

Schema-complete strict corpus (production, v3)

6.75M rows · 0 fallback · 0 duplicate keys

Production training corpus for the live ML surrogate: every row has metal × via-type × wall-thickness × glass × geometry × frequency in the feature schema. 0 fallback rows, 0 duplicate schema-complete keys. Group-disjoint train/test split keyed by (d, p, t, glass, metal, via, wall) — no row-level leakage; 78,840 unseen test geometries. Test R² = 0.9999966 / MAPE = 0.0292% / HBM4 MAPE = 0.0154%.

Per-regime BEM-vs-Palace cal heads (latest retrain 2026-05-05)

6/6 regimes deployed · pooled 17.10% → 4.71% (72.5% gap closure) · avg 74.0% per regime

First successful cross-physics calibration on Genesis. Per-regime BEM-vs-Palace μ-correction cal heads close the gap between fast quasistatic and full-wave solvers. Latest v2-clean retrain (2026-05-05) deploys all 6 buyer regimes with the following gap closure: ULTRA_HIGH_FREQ 90.5% (29.72% → 2.96% mean abs err vs Palace truth), HBM4 90.3% (13.06% → 1.42%), WIDE_PITCH 87.6% (10.26% → 1.52%), UCIE 81.6% (12.07% → 2.17%), EXTREME_TIGHT 50.6% (16.84% → 7.23%), MMWAVE 43.4% (20.64% → 12.95%). 4/6 elite (>80% gap closure); the EXTREME_TIGHT and MMWAVE regimes are honestly disclosed at lower closure — wider geometric / frequency distribution is harder to calibrate. Pooled BEM-vs-Palace error drops 17.10% → 4.71% (72.5% pooled closure). Production OFF by default; activate via CHIPLETOS_PALACE_RESIDUAL_HEAD=1. Multi-head loader returns N heads keyed by regime. Palace is full-wave FEM cross-physics truth, not VNA measurement.

Productization endpoints (5 net-new)

Pareto · DRC · validate-against-measurement · fab-coupon · cross-solver

POST /v1/glass-pdk/geometry-pareto — multi-objective Pareto front (Z₀ + IL + crosstalk + yield) with dominance ranks. POST /v1/glass-pdk/drc-validate — glass-interposer DRC with IPC-A-610G / SEMI E10 / ISO 9000-3 defect codes; per-violation rule_id + severity + suggestion. POST /v1/glass-pdk/validate-against-measurement — surrogate-vs-literature comparator with honest sim-vs-VNA flag; surrogate Z₀=51.04 Ω vs Sukumaran 2014 ref Z₀=50.3 Ω → 1.48% within tolerance, with explicit warning that matched reference is HFSS-coaxial simulation, not VNA measurement. POST /v1/coupons/export-fab — fab-ready bundle: validated GDS + DRC report + 12-layer lithography stack-up + per-foundry SOW (JSON + Markdown) + cost band; Amkor 5-coupon profile $55K-$165K. GET /v1/glass-pdk/cross-solver-matrix — 100-geometry × 5-solver disagreement witness with honest caveat that BEM is the only currently-100% solver. All 5 endpoints have pytest contract coverage (32/32 pass) and adversarial harness coverage.

Buyer adversarial harness

100/100 pass · 30/30 OOD recall · 5-min verification

Single-command DD reviewer harness: bash scripts/audit/buyer_verify.sh runs 6 steps in ~2 min. (1) Validation suite — drift sentinel + content_sha256 witness integrity + claim-trace manifest determinism. (2) Witness hash recompute (49/49 hashes match canonical). (3) Fresh canon-facts regeneration. (4) Diff vs frozen claims baseline — 54 frozen claims with content_sha256 tamper detection. (5) 110-geometry adversarial sweep (30 in_envelope_interior + 30 in_envelope_edge + 30 OOD + 20 boundary cases). First-run result: 100/100 pass + 30/30 OOD recall. (6) Productization endpoint smoke (5/5 endpoints respond 200). Optional Docker isolation. Harness DOES test (API stability + OOD detector firing + physical reasonableness); 32 pytest contract tests cover model accuracy and cross-solver agreement.

ChipletOS Photonic Signoff · Design-rule check (live)

AIM-Photonics-class DRC · 8 rules

v1. POST /v1/photonics/drc-photonic runs AIM-Photonics-class DRC: minimum waveguide width, bend-radius vs material, ring gap, grating pitch, MMI taper angle, port-to-port spacing, edge-to-edge clearance, layer-to-layer alignment. Returns violation list keyed to AIM rule IDs. Public, no auth. Scope: 5/6 trained AI surrogates at ≥ 0.99 R² vs our reference solver; waveguide on closed-form analytical fallback today. Higher-fidelity refresh on the roadmap. See Trust & Validation and ChipletOS Photonic Signoff.

ChipletOS Photonic Signoff · Published-paper cross-check

5 SOI Neff papers · within the analytical-model expected band

POST /v1/photonics/validate-against-ieee runs the analytical photonic stack against a published-paper corpus (Bogaerts 2018 · Pavanello 2020 · Lim 2014 · Selvaraja 2010 · Xu 2017) and returns per-paper MAE + pooled verdict. The pooled MAE sits within the analytical-model expected band vs published silicon-photonic references — passes the published-paper cross-check threshold. The trained AI surrogate v1 closes the gap to our reference solver on 5 of 6 primitives; higher-fidelity refresh on the roadmap. See Trust & Validation.

ChipletOS Photonic Signoff · Adversarial robustness harness

108-case smoke · 100% in-env + 100% OOD recall + 100% edge

108-case photonic adversarial harness smoke: 100% in-envelope pass, 100% out-of-distribution recall, 100% manufacturability-edge pass. Same harness pattern as the chiplet 110-geometry buyer_verify.sh sweep. Covers all 6 primitives (waveguide / MZI / MMI / ring / grating / photonic crystal). Scope: 5/6 trained AI surrogates at ≥ 0.99 R² vs our reference solver; cross-solver agreement check is on the roadmap. See Trust & Validation.

Inverse design + adjoint-BEM

r=0.99984 cross-physics · 8/8 cases pass

POST /v1/glass-pdk/geometry-from-target ships target Z₀ → recovered (d, p, t) in one call. Surrogate path: PyTorch autograd through the 3-seed ensemble + Adam over (log d, log p, log t) with hard projection to the regime-feasibility envelope and the manufacturability rule p ≥ 1.55·d. Optional ?refine=adjoint hands the surrogate optimum to a real adjoint-BEM gradient-descent stage (3 control vars × 2 sides = 6 forward solves per gradient eval). Cross-physics correctness witness: r_pooled = 0.99984 over 20 random geometries between BEM-FD gradient and PyTorch autograd through the surrogate (target r ≥ 0.95). All 3 components (∂Z/∂d, ∂Z/∂p, ∂Z/∂t) above 0.96; mean Z₀ disagreement 0.141 Ω. Smoke suite: 8/8 cases (50 Ω@28 GHz / 75 Ω@28 GHz / 50 Ω@77 GHz fused / 30 Ω@10 GHz HBM4 corner / 60 Ω@110 GHz / 50 Ω@180 GHz UHF / 40 Ω@28 GHz t=600 / 50 Ω@28 GHz t-free) converge under 2% Z₀ tolerance in ≤19 Adam steps and ≤2 s wall; 0.96% mean error post-refinement; mean cross-physics disagreement 0.77% of target. Patent provisional on file (4 independent + 6 dependent claims).

vpn_lock

Moats not visible from the live alias

Three things competitors can't replicate without us.

Beyond the surrogate metrics and signoff workflow, three structural moats live in the platform that don't show up in the R² / MAPE table.

Universal scaling law

Z₀^diff ∝ (sep/d)^0.338

Glass-TGV differential-pair impedance follows a single-exponent power law across 2.1M solver-derived pairs, R² = 0.918, exponent within ±0.01 across 5 published glass chemistries (EagleXG, AF32, Borofloat33, AN100, Quartz). New finding; not in literature; closed under the BEM-multiconductor patent family. Cuts diff-pair design from days to a single-line analytic formula.

MNDA glass corpus

142,965 proprietary rows

Three glass chemistries under multilateral NDA with the substrate vendors. Live in the BEM corpus; cited as 3 unpublished glasses in the manifest; not licensable separately. Competitors cannot reach this surface without re-running the same MNDA negotiation chain (12-18 mo + lawyers).

FastMCP distribution

First chiplet-domain MCP · 30 tools + 10 packaged agents

30 Model-Context-Protocol tools wrap the live signoff stack — predict_impedance, tgv_signoff, geometry_from_target, geometry_pareto, drc_validate, validate_against_measurement, export_fab_coupon, get_cross_solver_matrix, optimize_geometry, batch_sweep, validate_literature, validate_openems, yield_aware_design, panel_warpage, eye_diagram, export_design_bundle, export_aedt, export_ads_bundle, export_spice. Claude Desktop, Cursor, Codex, and any MCP-capable agent calls Glass PDK signoff and inverse design directly without an SDK install. We also ship 10 single-purpose agents (HBM4 Signoff / Inverse Design / Coupon RFQ / DRC Fixer / Cross-Solver Verifier / Pareto Explorer / Yield Risk / Interface Signoff / Provenance Auditor / Glass PDK Assistant). No EDA competitor ships an MCP. Buyer-relevant: SI/PI engineers using AI assistants get this by default.

Conformal coverage sidecar

95% Palace-truth coverage on every prediction

Distribution-free split-conformal interval surfaced on every /v1/glass-pdk/predict-impedance response. Per-regime quantiles built on 80% calibration slice + verified on held-out 20% test slice: HBM4 q=8.38 Ω cov=94.46%, UCIE q=6.03 Ω cov=95.41%, EXTREME_TIGHT q=35.10 Ω cov=94.42%, WIDE_PITCH q=9.84 Ω cov=94.44%; pooled fallback q=9.11 Ω. All 4 regimes within ±0.6% of nominal 95%. Mathematical guarantee — DD-defensible. Witness: genesis/ai/inference/conformal_quantiles.json. 6 + 3 regression pytest tests.

Per-axis OOD diagnostic

7-axis severity · in / borderline / out

Marginal |z|-score per input dimension against 6.75M-row training corpus mean+std. Severity buckets: |z|<2 = in (≈95% of training), 2≤|z|<3 = borderline (≈99%), |z|≥3 = out. Aggregate OOD fires if ANY axis hits "out" — single-axis OOD is enough to flag. Pinpoints WHICH input dimension (d / p / t / Dk / Df / freq / wall) is out-of-training-support, so buyers diagnose the failure mode rather than receive a black-box rejection. 7 pytest tests.

GenesisLite OSS slice

pip install glass-tgv-diffpair · Apache-2.0

Public open-source distribution of the universal scaling law log(Z₀_diff)=0.338·log(sep/d), R²=0.918, n=2.1M across 5 commercial glasses (AF32, Borofloat33, EN-A1, Eagle XG, Fused Silica) — exponent 0.338 universal to within 0.001 (0.3%). Plus BEM mesh-convergence verification suite + canonical Touchstone diff-pair annotation schema + 1M-row CC-BY-4.0 starter corpus. Closed Genesis platform retains: trained surrogate weights, per-regime conformal quantiles, adjoint-BEM inverse design, MNDA glass corpora, Palace residual cal heads, fab-coupon export. README explicitly enumerates open/closed split — controlled-transparency narrative for buyer DD. 18/18 contract tests; wheel pre-built at genesislite/dist/.

Composite lab_readiness_score

0-100 score + verdict bands · 3 routers

Single 0-100 buyer-facing score on /predict-impedance + /geometry-from-target + /coupons/export-fab. Verdict bands: send_to_lab ≥95 · send_with_extra_qc 80-94 · hold 60-79 · reject <60. Pro-rata weights across 5 active confidence checks: cross-solver 24 + conformal 24 + per-axis OOD 18 + public-data 18 + ensemble 16 = 100. Synthetic-noise injection deferred to post-VNA cycle so weights re-pro-rata. No fabricated proxy values: any check without measured signals contributes None and triggers partial_score=true; fab-coupon export refuses bundles when partial_score=true OR verdict ∈ {hold, reject}. 16/16 contract tests pass. See Trust & Validation for the full methodology.

Cross-solver matrix at 3/5 wired

BEM ✓ · Palace ✓ · FastHenry2 ✓ · OpenEMS skip · gprMax skip

100-geometry × N-solver disagreement matrix at 3 of 5 wired (with 4/5 spot-checks at 1-geom and 10-geom). BEM always-available + Palace via spack-installed binary + FastHenry2 via local binary (Z₀ = √(L_FH / C_analytical_coax) honestly disclosed as hybrid). OpenEMS + gprMax return None via try-import skip when binary not on PATH — never fabricates a number.

package_2

Glass Package Signoff Suite

25 Geometry Cases Through One Route

`POST /v1/chiplet-suite/package-signoff` now runs HBM4, UCIe, PCIe Gen6, 400G/800G, and 77 GHz radar examples across 25 named geometry cases through Glass PDK RF, EM Isolation Compiler isolation, and Bondability Pipeline bondability. The bundle emits Touchstone, report, manifest, and checksum files for each case.

25/25

Route-backed responses

25/25

RF target passes

25/25

Isolation target passes

1.69%

Mean worst Z0 error

Bondability Pipeline is tested separately through the measured-calibration lane below.

The production surrogate holds 25/25 RF passes at 1.69% mean worst Z₀ error (max 4.46%). FastHenry2 + OpenEMS + Palace cross-solver witnesses are active and independently verifiable, each SHA-256-hashed and UTC-timestamped.

Any future candidate must pass a CI-enforced promotion gate — it runs the 60-case benchmark under an env-override alias and rejects the candidate unless it beats the live model on both offline ML and route-backed signoff. The policy anchor is the matched 60-case live run (1.9436% mean / 6.734% max worst-Z₀) so all future gates compare apples-to-apples.

tune

Bondability Pipeline Calibration Signoff

Bondability Screening With Measured Anchors

`POST /v1/bondability/bondability-signoff`, metrology ingest, calibration update, and calibration-status routes now have a generated benchmark bundle. Baseline priors-only screening stays gated, while measured-anchor runs and wafer observations narrow the calibration state.

3/3

Measured-anchor passes

0/3

Priors-only passes

10/10

Calibration updates

68.0%

Mean CI reduction

Absolute fab-yield claims still require external wafer data.

science

BEM Impedance Validation

3.57% MAE vs 6 IEEE-Published HFSS-Coaxial Reference Points

Our Boundary Element Method solver was cross-checked against five independent peer-reviewed publications spanning four glass types. Mean Absolute Error: 3.57%. Provenance: the 5 reference Z₀ values are HFSS-coaxial extractions from each paper's published figures/tables (source_type:simulation per measurements.json) — not VNA measurements. ±40-60% published-tolerance bands. Real VNA fab campaign queued.

Paper	Glass	Published Z₀	BEM Z₀	Error
Sukumaran ECTC 2014	Eagle XG	48.0 Ω	51.02 Ω	+6.29%
Watanabe ECTC 2019	AF32	44.0 Ω	43.50 Ω	−1.14%
Shorey JMS 2016	Borosilicate	36.5 Ω	36.51 Ω	+0.03%
Tummala JEP 2020	EN-A1	34.0 Ω	34.32 Ω	+0.95%
Hwang TMTT 2017	Quartz	41.0 Ω	37.13 Ω	−9.44%
Mean Absolute Error			3.57%

verified

Multi-Method Validation

Independent Validation Stack

BEM impedance predictions cross-checked against independent published references (6 IEEE-published HFSS-coaxial extractions, 3.57% MAE — not VNA, see provenance disclosure on this page), commercial-standard inductance extraction (FastHenry2), 3D finite element (Palace 0.16.0 real transient FEM, n=100 multifreq), and a high-band OpenEMS witness at 28 / 77 GHz. Every method is open source and reproducible.

IEEE Literature (HFSS-coaxial refs)

Public

3.57% MAE vs 6 IEEE-published HFSS-coaxial reference points; not VNA — source_type:simulation per measurements.json

5 reference geometries

FastHenry2

LGPL

Golden-standard inductance extraction

120 comparisons (20 geometries × 3 glasses × 3 frequencies)

OpenEMS Named Witness

GPL-3.0

3D FDTD high-band witness (28 / 77 GHz)

Scored at 28 / 77 GHz; HBM4 and UCIe remain deferred challenge regimes

AWS Palace FEM

Apache-2.0

3D finite element transient electromagnetic solver

50/50 sims complete (100% success, 71.8 min, Docker on M4 Pro)

Why this matters: A single validation method tells you one thing. Two independent cross-references converging (IEEE-published HFSS-coaxial extractions at 3.57% MAE Z₀ + FastHenry2 magnetostatic across 180 inductance comparisons) confirm the BEM solver is correct within industry tolerance for the regime they cover. Palace 0.16.0 transient FEM (real cross-physics, n=100 across {28, 77, 110, 200} GHz) has been independently reproduced to completion with raw field and port data preserved. Real VNA measurement on Chipletos-designed coupons is the queued ~$200-500K wet-lab campaign. The bar for production EDA is cross-solver convergence on the converged validators plus preserved raw data.

polyline

Golden-Standard Inductance Extraction

120 FastHenry Cross-Validation Comparisons

FastHenry2 (LGPL, originally developed at MIT) is the industry-standard quasi-magnetostatic inductance extractor. Independent validation of BEM-derived Z₀ against FastHenry L across 20 geometries, 3 glasses, 3 frequencies each.

120

Cross-Validation Comparisons

TGV Geometries

Glass Substrates

31.8-66 Ω

BEM Z₀ Range Covered

Why FastHenry: BEM is a moment method in the quasi-static limit. FastHenry is a piecewise-constant filament integral — a fundamentally different numerical approach that converges to the same physics. Agreement between them rules out numerical artifacts in either.

Built from source on Ubuntu 22.04 with -fcommon flag for GCC 11 compatibility. Subprocess wrapper at genesis/solvers/fasthenry_wrapper.py.

deployed_code

3D Finite Element Validation

AWS Palace FEM Cross-Validation

AWS Palace (Apache-2.0) is a 3D finite element electromagnetic solver using MFEM, PETSc, and SuperLU_DIST. Transient time-domain simulations on TGV coaxial models, 200 time steps per geometry, cross-validating BEM predictions across 50 geometries and 5 glass types.

50/50

FEM Simulations Solved

100%

Success Rate (0 failures)

86 s

Mean Compute per Simulation

71.8 min

Total Compute on M4 Pro

Why Palace: A 3D FEM transient solver is orthogonal to quasi-static BEM. It resolves the full time-domain electromagnetic response, including dispersive effects. AWS maintains it for production semiconductor EM workflows — same tool used for superconducting qubit simulations.

Built from source in Docker (RockyLinux 9, 2.94GB image, Palace v0.13.0). 200-step transient simulations, PCG converging in 12 iterations per step. Coverage: 10 TGV geometries × 5 glass types (EagleXG, AF32, Borofloat33, FusedSilica, EN-A1). Mesh: 788–2,614 elements.

compare

Academic Yield Model Benchmark

Benchmarked Against UCLA YAP+

YAP+ (UCLA NanoCAD Lab, Apache-2.0) is the only other open-source hybrid bonding yield model in existence. Direct comparison across 25 test cases spanning 5 overlay sigmas and 5 pad pitches reveals where each model agrees and where they diverge.

Matched Test Cases

5 × 5

Overlay σ × Pad Pitch Grid

Apache-2.0

YAP+ License

Agreement @ Low σ

Both Models Match

Honest finding: The two models agree closely at realistic overlay values (σ < 0.1 µm), both predicting ~100% yield. They diverge at high overlay / tight pitch regimes, where YAP+ penalizes the overlay-to-pitch ratio more aggressively than the Murphy model in our Bondability Pipeline pipeline. This is a documented limitation we're addressing with the process window surrogate and Bayesian calibration.

Benchmark script: scripts/benchmark_yap_vs_prov9.py. Raw results: benchmarks/yap_vs_prov9_results.json.

model_training

BEM Surrogate Model

R² = 0.9520 Replay, 0.9516 Geometry Challenge

BEM v5 multi-output checkpoint re-scored on a 300,000-row sample from the 15.92M-row parquet corpus. The geometry challenge is diagnostic for an existing row-random checkpoint, not a separately group-trained holdout. The strict-group production cohort adds a four-run aggregate on 1.5M-row training runs, with mean unseen-geometry test R² 0.9751 at4.59% mean MAPE; HBM4 remains at 0.9093 / 8.07%. A larger merged strict release is still the next step. The expansion runs now total 901K separately versioned strict additions across UCIe, HBM4, MMWAVE, EXTREME_TIGHT, WIDE_PITCH, and ULTRA_HIGH_FREQ chunks.

0.9520

Row Replay R-Squared

0.9516

Geometry Challenge R-Squared

1000x

Speedup vs BEM Solver

96K

Unique Geometries Sampled

The BEM Multi-Task checkpoint is decomposed: Z₀ is learned; attenuation, phase velocity, and group delay are formula-derived RF outputs. Expansion chunks are cited separately until a canonical corpus merge.

new_releases

Forward Predictions — Unpublished Glass Types

142,965 BEM Predictions on 3 Unpublished Glasses

BEM impedance predictions for glass compositions with zero published TGV data. These are forward predictions verifiable by VNA measurement but available from no other source.

Corning Iris

Dk = 3.5 · 47,655 rows · 50 S2P files

Low-Dk RF specialty glass. Best 50Ω match: d=80µm, p=300µm, t=300µm.

Schott MEA

Dk = 6.1 · 47,655 rows · 50 S2P files

High-Dk glass for capacitive applications. Best 50Ω match: d=75µm, p=400µm, t=500µm.

Glass Core

Dk = 5.0 · 47,655 rows · 50 S2P files

Intel Foveros Glass candidate. Best 50Ω match: d=75µm, p=350µm, t=500µm.

tune

ILC Controller Benchmark

982/1000 Synthetic Wins Across All Controllers

The Iterative Learning Controller (ILC) with Zernike decomposition was benchmarked against five alternative control strategies in a 1,000-case synthetic Monte Carlo with analytical plant models. Mean gain: 87.83%.Scope: simulation benchmark (analytical response model), not wafer- hardware or FEM-solver-in-the-loop. Hardware validation is future work.

Controller	Wins (of 1000)	Mean Gain	Status
PID Baseline	982	87.83%	ILC wins
LQR Optimal	982	87.83%	ILC wins
MPC Predictive	982	87.83%	ILC wins
Sliding Mode	982	87.83%	ILC wins
Fixed Gain	982	87.83%	ILC wins

Zernike decomposition (n=1..6, 27 polynomial terms) enables wafer-level distortion correction that conventional PID/MPC cannot match. The 18 non-wins are edge-case fields where ILC and the alternative tie within measurement noise.

shield

Isolation Synthesis Engine

Adjoint Gradient Correlation: r = 1.0

The adjoint topology optimizer in the Isolation Synthesis Engine was validated against finite-difference gradients to numerical precision. Adjoint-to-FD correlation r = 1.0 across 5 synthesis families and 10 frequency bands.

r = 1.0

Adjoint Gradient Correlation

Adjoint-to-finite-difference gradient correlation across 10 design cases. Sign agreement 10/10.

Synthesis Families

Via fence, mushroom EBG, fractal EBG, slotted metasurface, and topology-optimized. All synthesize end-to-end to DRC-clean GDSII.

GDSII

DRC-Clean Export

Closed-loop synthesis to KLayout DRC-verified GDSII in a single pipeline. The only tool that designs, not just analyzes.

psychology

FNO Yield Screening Model

Screening-Grade Yield Risk Prediction

The Fourier Neural Operator is a screening layer on top of the physics pipeline. It reliably identifies high-risk vs low-risk regions in a layout, enabling fast design-space exploration before committing to full physics verification.

R² = 0.50

Pixel-Level Accuracy

Measured on 20,000 held-out test samples spanning the full operational parameter range.

R² = 0.63

Image-Max Accuracy

Aggregate accuracy at the image level for identifying the worst-case yield region per layout.

13ms

Inference Latency per Die

CPU inference. Enables full-wafer screening at interactive speeds, feeding high-risk regions into the BEM and contact mechanics pipeline.

Full validation methodology and training data details available in the NDA data room.

speed

Inference Performance

Production Latency: Every Solver Under 100ms

All inference latencies measured on CPU. No GPU required for production workloads. The entire platform runs on standard cloud compute.

Solver	Latency	Platform
FNO Yield Model	13ms / die	CPU
BEM Impedance (in-process)	2.7 ms p50, 3.36 ms p99	Apple M4 Pro
Full API Pipeline (end-to-end)	332 ms p50, 379 ms p99 warm; ~2.4 s cold-start	Modal serverless
ILC Controller	<5ms / step	CPU
Isolation Compiler	2–30s	CPU

infoIsolation Compiler uses iterative adjoint optimization — longer latency is expected and represents full synthesis, not a single inference pass.

precision_manufacturing

KLA Calibration Convergence

10 Wafers to CI<20µm

10,000-campaign Bayesian Design of Experiments proves that 10 wafers is the minimum investment for statistically meaningful correlation length calibration.

Minimum Wafers

For CI<20µm on correlation_length. The practical threshold for production-grade calibration.

60.2%

Hit Rate at 10 Wafers

Percentage of campaigns achieving CI<20µm with only 10 wafers of measurement data.

100%

Hit Rate at 20 Wafers

Every campaign converges with 20 wafers. The cost to reach certainty is known and bounded.

verified_user

Evidence-Backed Portfolio.

Every claim is backed by reproducible benchmarks. Every number on this page is verified against source code. The full evidence package, including reproducibility scripts, is available in the NDA data room.

909+

Drafted Claims

Technology Areas

1970

Tests Collected

Benchmark Evidence Files

Request Full Evidence Package

Raw benchmark data and reproducibility scripts available under NDA.