Skip to main content
aifinhub
Backtesting & Validation Checklist

Overfitting Red-Flags Checklist

Overfitting is selection bias wearing a backtest. The more configurations you try, the higher the best-looking Sharpe rises by luck alone. This checklist is a list of warning signs to scan for before allocating capital and after any round of optimization. Essential flags are near-certain disqualifiers if unaddressed; recommended flags warrant deeper investigation; nice-to-have flags help you quantify how much of the result is real.

By AI Fin Hub Research · AI Fin Hub Team

On This Page

Checklist Progress

Move item by item and keep your place

Progress saves locally, so you can work through the page over multiple sessions without resetting your checklist.

0/12 complete

Checklist Sections

Work in focused batches instead of one long wall

Section 1

Phase 1: Search-process flags

3 items
Use The ToolCalculators

Backtest Overfitting Score

Upload a backtest trade log and compute Probability of Backtest Overfitting (PBO), Deflated Sharpe Ratio, and the odds your edge survives live trading.

ToolOpen ->
Use The ToolCalculators

Deflated Sharpe Ratio Calculator

Bailey & López de Prado deflated Sharpe — corrects observed Sharpe for selection bias across K trials. Reports deflated Sharpe, PSR (probability of skill).

ToolOpen ->

Section 2

Phase 2: Parameter fragility flags

3 items

Section 3

Phase 3: Equity-curve flags

3 items

Section 4

Phase 4: Generalization flags

3 items
Use The ToolPlaygrounds

Walk-Forward Validator

Upload a returns CSV. Rolling or expanding IS/OOS windows, per-window Sharpe, walk-forward efficiency, and a concatenated OOS equity curve. Catches regime.

ToolOpen ->

Pro Tips

Small moves that make the checklist easier to finish

The trial count is the single number that turns a raw Sharpe into an honest one. If you cannot state it, you cannot claim the strategy is validated.
Robust edges are plateaus, fragile ones are spikes. When you optimize, look at the whole parameter surface, not just the coordinate of the maximum.
When the deflated Sharpe is marginal, the cure is fewer trials or more data, never a harder search. Searching harder is exactly what manufactured the overfit.

Try These Tools

Run the numbers next

Sources & References

Related Content

Keep the topic connected

Planning estimates only — not financial, tax, or investment advice.