Interesting experiment arbitraging favorite-longshot bias on polymarket/kalshi
I developed an application for automating and managing trades on polymarket and kalshi. About 6 months ago I started looking into whether the favorite-longshot bias from horse racing and sports betting shows up in prediction markets. It does. I pulled all 59,000 resolved binary markets off their API and ran a calibration study. Markets priced in the 40-50% range resolve Yes only about 22% of the time. Sports and games are the strongest categories.
Why I don't think it's noise
I ran Benjamini-Hochberg FDR correction at q=0.05 across 537 calibration cells. 78 survived. You'd expect about 27 by chance, so that's roughly 2.9x the false discovery rate.
Before writing any strategy code I set hard kill gates. If I couldn't find at least 8pp of miscalibration in a tradeable category, I'd stop. It passed. I also caught a Simpson's paradox artifact where the bias looked like it was growing over time but it was really just Sports becoming a bigger share of the market mix.
The part I think matters most: I tried expanding to Kalshi using 7.68 million markets and the kill gate failed. The apparent signal was a bucket-assignment artifact at the 50-cent line. I killed that track. The gates aren't decoration.
Backtest (in-sample, take it for what it's worth)
About 4,850 signals, ~150 trades after filtering. 64.6% win rate, Sharpe 1.21. Simulation from $3K to ~$8K, max drawdown 25.1%. Average hold around 20 days.
Where I'm at now
Paper trading went live this week. 12 positions open, $4K of $10K deployed. First resolutions in a week or two. I'll post an update with out-of-sample numbers when I have them, pass or fail.
I'm not sharing the specific cell map, classification system, or pipeline logic. That's the IP. But if you pulled the data yourself you'd confirm the FLB exists. Knowing it exists and knowing exactly where to trade it are different things.
What I'm curious about
- Does the methodology hold up or am I missing something obvious?
- Anyone ever commercialized capacity-constrained trading IP? The ceiling here is ~$50-100K before you move the markets you're trading. I'm deploying my own capital but I've been wondering if the methodology has standalone value as education or research.
- If you trade prediction markets, does the FLB claim match your experience?
Happy to talk process and stats. Not sharing the cell map or pipeline specifics.