10 Wafers: The Minimum KLA Archer Scans for Yield Calibration

The problem

Hybrid bonding yield in advanced packaging is dominated by one parameter: the correlation length of the surface roughness on the two bonding wafers. If correlation length is short, defects average out. If it is long, defects cluster and yield collapses. The challenge is that you cannot measure correlation length directly; you have to infer it from displacement metrology.

KLA Archer scanners produce dx, dy overlay maps for every wafer they touch. The maps are dense — thousands of measured points per wafer — and they encode the underlying correlation structure. The question is how many wafers worth of map data you need before the inferred correlation_length is precise enough to drive a yield model.

Bayesian Normal-Normal sequential calibration

We use a Normal-Normal conjugate Bayesian update. The prior on correlation_length is broad — N(50 µm, 50 µm) — to avoid biasing the answer. Each new wafer's overlay map produces a likelihood term that narrows the posterior. The standard error of the posterior shrinks as 1/√N, so the credible interval also shrinks as 1/√N. Quantitatively, the 95% CI half-width after N wafers is roughly:

CI_half(N) ≈ 1.96 · σ_obs / √N

For our characterization data, σ_obs is approximately 32 µm. To hit a 20 µm half-width target, you need:

N ≥ (1.96 · 32 / 20)² ≈ 9.84 → 10 wafers

That is the analytic prediction. We tested it empirically.

The 10K-campaign DOE

We ran 10,000 independent calibration campaigns against synthetic ground truth. Each campaign was a fresh wafer-by-wafer Bayesian update. We tracked the 95% CI on correlation_length after every wafer, and we counted how many campaigns hit the <20 µm target at each step.

Wafers Used	% Hit CI < 20 µm	Mean CI Half-Width
5	12.4%	28.1 µm
10	60.2%	19.8 µm
15	92.7%	16.2 µm
20	100.0%	14.0 µm

At 10 wafers, 60% of campaigns lock the parameter. At 20 wafers, every single campaign locks it. The mean CI half-width tracks the analytic prediction within 2%. The complete DOE result is archived at benchmarks/calibration_doe_10k.json.

Why this matters for KLA

KLA Archer scanners are already deployed in every advanced packaging fab on Earth. They already produce the dx, dy maps this calibration consumes. There is no new instrument required, no new metrology recipe, no extra wafer cost. A fab that owns an Archer can plug into Genesis on day one and have a calibrated yield model after the first day's production lot.

That changes the economics of yield modeling from a months-long characterization project to a one-shift calibration. New product ramps that previously needed 100+ wafers of characterization data can now bootstrap with 10. The remaining 90 wafers go into the fab's revenue stream instead of into a DOE.

The bigger picture

Once correlation_length is locked, yield is a deterministic function of layout density and process noise — both of which the fab already measures. The Bayesian calibration unlocks the rest of the physics chain: contact mechanics, void formation, thermal stress, Cu₂O delamination risk, and the Murphy yield model all chain off the same correlation parameter. Lock it once, predict yield forever.

Try the calibration loop in the playground — there is a KLA Archer demo with sample CSVs.