9  Variance, Standard Deviation & the Long Run

Poker is a game of skill played over a fog of luck. Every decision you make has a correct answer in the long run — the play that wins the most chips on average — but on any single hand, the universe is free to ignore your skill entirely. You can get all-in with aces against a single opponent holding deuces, an 81% favorite, and lose. You can lose three times in a row in that exact spot and watch a third of your stack evaporate, and nothing you did was wrong.

This chapter is about that gap: the space between how well you play and how your results actually look. Understanding variance is not optional knowledge for a serious player. It is the difference between a professional who calmly reloads after a brutal session and an amateur who blows up, chases losses, and quits the game convinced it is rigged. The math here is genuinely intuitive once you see it clearly, and seeing it clearly is one of the most emotionally protective things you can do for your poker career.

9.1 What variance actually is

In everyday speech, “variance” just means “swings.” In statistics it has a precise meaning, and the precise meaning is worth holding onto because it makes everything downstream easier.

When you play a hand of poker, the outcome — how many big blinds you win or lose — is a random variable. Over many hands, that random variable has two numbers that describe it:

  • The mean (your expected value, or EV): the average result per hand, or more usefully per 100 hands. This is your winrate. A good small-stakes online cash player might run at, say, 5 bb/100 — five big blinds of profit for every hundred hands dealt.
  • The standard deviation (SD): a measure of how far individual results typically stray from that mean. This is the size of your swings.

Variance is technically the square of the standard deviation, but in practice players use the words loosely and you should care about the standard deviation, because it is in the same units as your winrate (big blinds) and so you can compare them directly.

TipKey idea

Your winrate tells you how much you make. Your standard deviation tells you how much that winrate hides under noise. The whole problem of “the long run” is that in poker the noise is enormous relative to the signal — the SD dwarfs the winrate — so it takes a shocking amount of data before your real skill becomes visible in your results.

9.2 Typical standard deviations: knowing the numbers

These figures come from tracking-software databases across millions of hands and hands of solver-era population study. Treat them as typical ranges, not laws of physics — your personal SD depends on your style (loose-aggressive players swing more), the format, and the player pool.

Cash games (6-max No-Limit Hold’em), per 100 hands:

  • A tight, straightforward player might run around 75–85 bb/100 of standard deviation.
  • A typical solid regular sits around 85–100 bb/100.
  • A loose, aggressive, three-bet-happy player can push 100–120 bb/100 or higher.

So the common rule of thumb — standard deviation of roughly 80–100 bb/100 in 6-max cash — is a good working number. Full-ring (9-handed) games run lower, often 60–75 bb/100, because you see fewer flops and play fewer big pots. Heads-up runs much higher because you play every hand.

Tournaments (MTTs) are a completely different animal. Variance there is far higher and is usually measured in ROI (return on investment) rather than bb/100, because the prize structure is top-heavy. Most of your profit comes from a small number of deep runs and final tables, which by their nature happen rarely. A winning MTT player with a healthy 30% ROI can — routinely — go hundreds of buy-ins without a meaningful score. We will not put a single SD number on MTTs because it is misleading; the practical takeaway is that tournament downswings are measured in months and thousands of games, not sessions. Chapter 41 returns to why this forces dramatically more conservative bankroll rules for tournament players.

WarningCommon mistake

“I’m a losing player — I haven’t won in three weeks.” Three weeks is not a sample; it is a coin flip. Many genuinely winning players have month-long losing stretches, and many genuinely losing players have month-long heaters that convince them they’ve “figured it out.” You cannot diagnose your winrate from how the last few sessions felt. You need data and you need a lot of it.

9.3 Why a winrate needs a gigantic sample

Here is the single most important quantitative fact in this chapter, and we will build it up so it feels obvious rather than mysterious.

Suppose you are a real, true 5 bb/100 winner with a standard deviation of 100 bb/100. After playing N hundred-hand blocks, two things happen:

  1. Your expected total profit grows in proportion to N. Over N blocks you expect to win 5 × N big blinds.
  2. Your uncertainty — the standard deviation of your total result — grows in proportion to the square root of N. It is 100 × √N.

That asymmetry is the entire story. Your edge accumulates linearly; your luck accumulates only as a square root. Early on, the square root term is monstrously bigger than the linear term, and your results are basically noise. Only as N gets large does the linear signal finally outrun the square-root noise.

Let us make it concrete by looking at the standard deviation of your measured winrate (your profit divided by the number of blocks). That uncertainty shrinks as:

\[\text{SD of measured winrate} = \frac{100}{\sqrt{N}} \text{ bb/100}\]

where N is the number of 100-hand blocks. Plug in some sample sizes:

Hands played Blocks (N) SD of your measured winrate
10,000 100 10.0 bb/100
100,000 1,000 3.2 bb/100
500,000 5,000 1.4 bb/100
1,000,000 10,000 1.0 bb/100

Now read that table with the eyes of our true 5 bb/100 winner.

At 10,000 hands the noise around your measured winrate is ±10 bb/100 (one SD). Your true winrate is 5. So after ten thousand hands — which can be a couple of weeks of serious online grinding — your observed winrate could very plausibly be anywhere from −5 to +15 bb/100. You genuinely cannot tell whether you are a solid winner or an actual loser. The data simply does not contain the answer yet.

At 100,000 hands the noise is ±3.2 bb/100. Now your true 5 produces an observed result usually somewhere around 2 to 8. You are probably a winner, but you still can’t pin the number down well.

At 1,000,000 hands the noise is ±1.0 bb/100. Finally your 5 bb/100 edge is clearly visible above the noise. This is why serious players talk about a million hands as the neighborhood where a cash winrate becomes trustworthy.

TipKey idea

A useful slogan: edge is linear, luck is square-root. Doubling your sample does not halve your uncertainty — you have to quadruple the sample to halve it. That is why confirming a winrate is so brutally data-hungry, and why your gut feeling about “how I’m running” is almost always premature.

9.4 Confidence intervals: putting error bars on yourself

A confidence interval turns the table above into an honest statement about what you know. The standard (roughly 95%) interval is your observed winrate plus or minus about two standard deviations of the measurement.

Worked example. You have played 50,000 hands of 6-max cash. Your tracker says you are winning at 6 bb/100. Your SD is 100 bb/100. Should you be confident you’re a 6 bb/100 crusher?

Step 1 — count the blocks. 50,000 hands = 500 blocks of 100. So N = 500.

Step 2 — SD of the measured winrate: 100 / √500 = 100 / 22.4 ≈ 4.5 bb/100.

Step 3 — 95% confidence interval: 6 ± 2 × 4.5 = 6 ± 9.0, i.e. roughly −3 to +15 bb/100.

Read that out loud. After fifty thousand hands of winning poker, the honest statistical statement is “I am 95% confident my true winrate is somewhere between losing 3 bb/100 and winning 15 bb/100.” The interval still straddles zero. You cannot yet rule out that you are a small loser running hot. That is not pessimism; it is just the arithmetic of a high-variance game.

This is humbling, and it should change how you behave. It means:

  • You should not radically overhaul a winning strategy because of a bad week, nor crown yourself a genius after a good month.
  • You should treat your tracker’s winrate as a moving estimate with error bars, not a verdict.
  • You should keep studying regardless of short-term results, because the results are too noisy to be your teacher on their own.
NoteDrill

Open your own database (or imagine plausible numbers). Take your current sample size in hands, divide by 100 to get N, and compute 100/√N. Multiply by 2. That is your 95% error bar in bb/100. Write down the sentence: “My true winrate is probably between [observed − error] and [observed + error].” Sit with how wide that range is. Most players are stunned the first time they do this honestly.

9.5 Downswings: how deep and how long

A downswing is just a stretch where the square-root noise happens to run against you. Because the noise is so large, downswings that feel catastrophic are completely routine.

A rough way to picture the depth: over a long sample, the size of a “normal” worst-case dip from a peak scales with your SD and with the square root of how many hands you play, while your winrate slowly fills the hole back in. For our 100 bb/100, 5 bb/100 winner, buy-in-sized downswings (say, 20+ buy-ins, where one buy-in is 100bb) are not rare events — they are expected to happen multiple times across a career. A break-even-or-slightly-winning player can be stuck in a downswing that lasts hundreds of thousands of hands, simply because their thin edge takes forever to climb out of a hole the variance dug.

Two consequences follow, and they are the practical heart of this chapter:

  1. The depth of a downswing you should plan for is driven by your SD, not by how well you play. A great player with high variance can have deeper dollar swings than a mediocre player with low variance. Skill reduces the frequency and eventual recovery of downswings; it does not exempt you from them.

  2. You will spend a large fraction of your playing life below your all-time peak. This is mathematically guaranteed for everyone, winners included. The graph in your head where good players’ bankrolls march smoothly upward is a fantasy. Real winning graphs are jagged staircases with long, ugly plateaus and sickening drops, trending up only when you zoom all the way out.

9.6 All-in EV: separating decisions from outcomes

Modern tracking software (Hold’em Manager, PokerTracker, and similar) shows two lines on your results graph: your actual winnings (the green line, your real money) and your all-in EV (sometimes the blue or red line). The all-in adjusted line answers a specific, useful question: what would my results look like if every all-in had resolved exactly according to its probability?

Here is how it works. You get all-in on the turn with a flush draw and a pair, holding 45% equity for a 200bb pot. In reality you either win the whole 200 or win nothing. But the EV line credits you with your fair share: 45% of 200 = 90bb, every single time, regardless of what the river actually brings. Do this across thousands of all-ins and the EV line smooths out the river luck.

  • If your green (actual) line is below your EV line, you have been running below expectation on all-ins — “running bad,” losing flips and getting drawn out on more than your share. The gap is, to a first approximation, luck, not bad play.
  • If your green line is above your EV line, you’ve been running good — winning more all-ins than your equity deserved.
WarningCommon mistake

Two errors cluster around the EV line, and they are opposite mistakes.

Error one: treating the EV line as your “real” winrate. It isn’t. The EV line only removes variance from the moment of the all-in — it assumes your all-in equity was correct, and it does nothing about the variance in whether you got it in good in the first place, or the variance in non-all-in pots. A player who repeatedly stacks off as a 30% underdog can have a beautiful EV line and still be torching money; the EV line happily credits them their 30% share of pots they should never have entered.

Error two: using “I’m running below EV” as a permanent excuse. Over a big enough sample the green and EV lines converge. If after 300,000 hands your green line is still dramatically below your EV line, the more likely explanation is no longer bad luck — it may be that your equity is systematically worse than you think because you’re getting it in against ranges that crush you, or your reads at the moment of stacking off are off. Variance hides skill problems; it does not erase them.

The healthy use of the EV line is as a tilt-control instrument. After a brutal session, before you conclude you played badly, check whether the green line cratered while the EV line held steady. If so, you ran bad — your decisions were probably fine and the river was unkind. That knowledge won’t refund the money, but it can stop you from “fixing” a strategy that was never broken.

9.7 A fully worked hand: right play, wrong result

Let us walk one all-in through both lenses, decision and outcome, so the distinction is concrete.

Setup. 100bb cash, 6-max. You are in the cutoff with A♠ K♠. UTG (a tight, solid regular) opens to 3bb. It folds to you; you 3-bet to 10bb. UTG 4-bets to 24bb. You shove 100bb. UTG snap-calls and tables Q♥ Q♦.

The board runs out J♣ 8♦ 3♠ 9♥ 2♣. You miss everything. You lose 100bb.

Decision analysis. Against a tight UTG’s 4-betting range — typically QQ+, AK, and a few bluffs at most pools — A K is doing fine. Against this specific range (call it QQ, KK, AA, AK, with a bluff or two) your AKs has somewhere in the neighborhood of 40–45% equity. Getting 100bb in when you have ~42% equity and you’re closing the action means you’re putting in your stack to win a pot where your fair share exceeds your investment once you account for fold equity, the dead money already in the pot, and the chance UTG is 4-betting lighter than the nightmare range. The shove is standard and correct.

Outcome analysis. You held roughly 42% equity. The EV line credits you 0.42 × (final pot) regardless of the board. Your actual result was zero. You ran about half a stack below expectation on this single pot. Multiply spots like this over a session and you get the gap between a green line and an EV line.

The lesson. If you replay this hand and feel sick — if you start thinking “I should have just called the 4-bet” or “I should fold AK to tight regs” — you are letting a single high-variance outcome rewrite a correct strategy. The fix is to internalize that you can lose a hand you played perfectly, lose it again the next orbit, and still be the favorite to profit from that exact spot over a career. The board J-8-3-9-2 owes you nothing this hand; the math owes you over ten thousand hands.

NoteDrill

For your next five losing all-ins, write two columns: “Was the get-in correct? (equity + range read)” and “What was the result?” Force yourself to grade the decision without looking at the outcome. The goal is to build the mental habit of evaluating choices independently of results — the single most important psychological skill variance demands. (We go much deeper on this in the tilt and mental-game chapters.)

9.8 The emotional cost — and why it demands bankroll discipline

Everything above is math, but the reason the math matters is human. Variance is not just a statistical property of poker; it is a sustained assault on your emotional stability, and it is engineered — accidentally — to break exactly the cognitive machinery you need to play well.

Here is the trap. Humans are wired to learn from outcomes: do a thing, get a reward, repeat; get punished, stop. That instinct is brilliant for most of life and poison for poker, because poker deliberately decouples the reward from the decision in the short run. If you let outcomes train you, variance will teach you to fear correct plays that recently lost and to love reckless plays that recently won. You will, in effect, be reprogrammed by noise into a worse player. The defense is to anchor your sense of “did I play well?” to your decisions and your study, and explicitly not to the session’s result.

But emotional discipline alone is not enough, because even a perfectly stoic player can go broke through bad luck if they are underfunded. This is the bridge to bankroll management (the full treatment is Chapter 41). The logic is simple and ironclad:

  • Downswings of many buy-ins are not tail risks; they are expected events for everyone.
  • If your entire bankroll is only a handful of buy-ins, an ordinary, statistically routine downswing can bust you before your edge ever gets the sample size it needs to show up.
  • Therefore the size of your bankroll is not really about how good you are — it is about how big your standard deviation is. High-variance formats and styles demand fatter bankrolls. Cash games, with their ~80–100 bb/100 SD, are commonly played on something like 20–40 buy-ins; tournaments, with their wildly higher variance, demand hundreds of buy-ins. Those numbers are downstream of the variance math in this chapter.
TipKey idea

Bankroll management is variance insurance. You are not buying it because you doubt your skill — you are buying it precisely so that your skill gets to play out over a long enough sample to matter. An undercapitalized genius and a well-capitalized solid winner are not in the same business: the first is gambling on the short run, the second is investing in the long run.

9.9 The mindset that survives the long run

Pulling it together, the winning relationship with variance rests on a few load-bearing beliefs. Hold these and the swings become tolerable; lose them and even a real edge won’t save you.

  1. Results in the short run are evidence about luck, not skill. Your skill is measured by the quality of your decisions, which you can evaluate hand by hand. Your luck is measured by the gap between your green line and your EV line, and by your distance from your error-barred true winrate.

  2. The long run is longer than you think. Tens of thousands of hands is a teaser, not a sample. Hundreds of thousands begins to mean something. A million is where a cash winrate stands up. For tournaments, multiply all of that.

  3. Downswings are weather, not climate. They are guaranteed, they are deep, they are no one’s fault, and they end. Your job during one is to keep making good decisions, keep your stakes appropriate to your shrinking bankroll, and refuse to let the noise rewrite your strategy.

  4. Detachment from outcomes is a trainable skill, and it is the master skill of the high-variance game. Everything in the psychology chapters — tilt control, focus, confidence — ultimately reduces to this: can you keep playing your best when the cards are actively lying to you about how well you’re playing?

Get the math into your bones, fund yourself properly, judge your decisions and not your dollars, and the variance that destroys most players becomes the very thing that pays you — because it keeps the weaker players, the ones who can’t see past the short run, coming back to the table again and again.