The H6 v8 XGBoost Ecosystem and Three-Tier Cohort System: A Technical Note
Philipp Baro · Hedonix Research, Frankfurt am Main
Abstract
This technical note documents the production cutover, on 15 May 2026, of the H6 model ecosystem from the OLS hedonic specifications of Hedonix Working Paper No. 2026-02 to a coherent XGBoost ensemble trained on a Sun & Moon–through–Modern-Era panel of 1,733 cards (PSA-10 and PSA-9 targets) and 2,822 cards (raw target). The cutover is motivated by an empirical plafond observed during a four-round refit cycle: OLS on the broader heterogeneous-rarity panel loses approximately 15 percentage points of leave-one-set-out R² on PSA-10 relative to the narrow v3 SV-only panel. A tree-ensemble architecture on the same panel recovers 5 percentage points of the gap on PSA-10 and strictly beats the narrow v3 baseline on PSA-9 (LOSO R² 0.812 vs. 0.745, median absolute percent error 19.4% vs. 24.3%). We document the three production models, their in-sample, 5-fold cross-validation, and leave-one-set-out metrics, and the methodological rationale for accepting a stress-test LOSO penalty in exchange for production-realistic coverage. We additionally introduce a three-tier cohort system that assigns each cohort-eligible card three independent quintiles — one per grade tier — and decouples this analytic surface from the strict 648-card raw-XGB universe that backs the live Hedonix Index NAV. This note is an architectural change-log; the live-track and walk-forward analyses on the v8 architecture remain forthcoming.