Adversarial audit of the recommended bid portfolio. Every optimistic assumption is assumed wrong; each failure mode is quantified in dollars against a $2,500 stake.
Re-derived from 2024-25 prices joined to real finishes, paying exact prices and exact flight payouts (winner 70% / runner-up 30% of the payout pool; 10% of the pot directed to the shootout).
| Strategy (1 team/flight) | 2024 ROI | 2025 ROI |
|---|---|---|
| Buy the favorite (priciest) | -38.2% | +18.0% |
| Buy the cheapest | -1.7% | -8.3% |
| Buy everything (whole pot) | -10.0% | -11.1% |
Best dollar-edge teams, one per flight, ≤$2,500 at est_price.
Fair value = P(win)·0.70·0.90·pot + P(2nd)·0.30·0.90·pot, reproduced from the bid sheet.
| Flt | Team | P(win) | Pot | Fair | Price | Edge | Sandbag |
|---|---|---|---|---|---|---|---|
| 20 | Barnes + Smelcer | 30.0% | $1,200 | $288 | $200 | +88 | |
| 9 | Wood + Estes | 32.2% | $4,000 | $986 | $700 | +286 | yes |
| 12 | Wright + Gadsby | 21.6% | $3,300 | $603 | $500 | +103 | yes |
| 8 | Nodar + Heslep | 20.5% | $4,700 | $814 | $700 | +114 | |
| 15 | Alexander + Lambeth | 17.6% | $2,700 | $418 | $400 | +18 |
Baseline (everything goes right): spend $2,500 → EV return $3,111 → net +611 (+24%). This is the number under test.
EV hit vs. the +611 baseline, each mode in isolation. Bars scaled to the largest hit.
| # | Failure mode | Mechanism | EV hit | Net after | Severity |
|---|---|---|---|---|---|
| 1 | Efficient market | Price = unbiased fair value | -$861 | -$250 | |
| 2 | Pot -25% (fixed price) | Room spends less; our prices already committed | -$778 | -$167 | |
| 3 | Winner's curse / overpay | We win bidding wars - pay our max-bid ceiling | -$600 | -$8 | |
| 4 | Buyback adverse selection | Owners reclaim winners (own 50%), dump losers (own 100%) | -$319 | +$292 | |
| 5 | Sandbag doesn't convert | Flagged teams regress halfway to field average | -$247 | +$363 | |
| 6 | First right of refusal | Strong teams pulled at the $2,000 cap; we get the weaker pool | -$141 | +$470 |
| Pot vs. projection | -30% | -20% | -10% | base | +10% | +30% |
|---|---|---|---|---|---|---|
| Net EV | -$323 | -$12 | +$299 | +$611 | +$922 | +$1,544 |
Our cost is fixed once we bid; the pot is not. A pot 20% light zeros the edge; 30% light is a loss.
64% of the portfolio's entire edge (+$390 of +$611) comes from just 2 sandbagger teams. If the room also bids them up, edge → +$221. If the sandbag doesn't convert in the 2-man net better-ball, edge → +$363.
Precedent: Williford — the headline confirmed sandbagger — finished 5th in flight 7 in 2025 despite a scratch-level game. Individual ball-striking does not reliably win a team match-play points race.
200k-sim Monte Carlo on the model-optimistic portfolio (own 100%, model correct — the best case).
Even assuming the model is right, we lose money 2 years in 5 and lose half the stake 1 year in 5. 5 teams on $2,500 is over-concentrated for a near-zero-edge, high-variance game.
The edge is real only on paper and rests almost entirely on 2 sandbagger teams and a single anomalous backtest year (2025). Play small, or not at all. If the partnership plays anyway:
backtest.py + stresstest.py
+ make_report.py; figures in backtest_results.json / stress_results.json.