backtest / engine confidence
Recommendation outcomes and calibration
Every daily add, trim, hold, and watch call becomes a trial. The system labels 1M, 3M, 6M, and 12M forward outcomes as they mature, then checks whether expected return, evidence, timing, and sizing actually worked.