When can I use the binomial distribution?

When all four BINS conditions hold: two outcomes per trial (binary), independent trials, a fixed number of trials n, and the same probability of success p on each trial. If any condition fails — most often independence (sampling without replacement) or fixed n (you stop when something happens) — a different distribution is more appropriate.

How is the binomial different from the geometric distribution?

The binomial counts the number of successes in a fixed number of trials. The geometric counts the number of trials needed to get the first success. The two share a setting (independent Bernoulli trials with constant p) but ask opposite questions: how many successes, or how long until one?

What if my trials are sampled without replacement?

Then independence is violated and the count follows a hypergeometric distribution, not a binomial. The hypergeometric formula uses combinations from the finite population. As a rule of thumb, if the sample is small relative to the population (less than 10%), the binomial is usually a good approximation even with sampling without replacement.

When can I use a normal approximation instead of the exact formula?

When np and n(1 − p) are both at least 10, the binomial is well-approximated by a normal distribution with the same mean np and standard deviation √(np(1 − p)). Apply a continuity correction (add or subtract 0.5 from the boundary) for the best accuracy, especially when n is only moderately large.

Binomial Distribution Walkthrough: Formula, Conditions, Worked Example

The binomial distribution shows up the moment a problem asks for the probability of a certain number of "successes" in a fixed number of independent trials. Coin flips, free throws, defective parts on an assembly line — all binomial, once you check the conditions. This walkthrough is the conditions, the formula, the mean and variance, and a full worked example.

The Four Conditions for a Binomial Setting

Before you reach for the formula, check that the situation actually fits the model. A binomial distribution applies when BINS holds:

B — Binary outcomes. Each trial has exactly two possible results, labeled "success" and "failure."
I — Independent trials. The outcome of any one trial does not affect any other.
N — Fixed number of trials, n. You decide n in advance; it does not depend on the data.
S — Same probability of success, p. The probability of success stays the same on every trial.

If any of these fails, binomial is the wrong model. A common failure is independence: drawing cards without replacement changes p as you go, so the count of Hearts in 5 draws from a 52-card deck is not binomial — it is hypergeometric. With replacement, it is binomial.

The Binomial Formula

If X is the number of successes in n independent trials, each with probability p of success, then X follows a binomial distribution with parameters n and p, written X ~ Binomial(n, p). The probability that X equals a specific value k is

P(X = k) = C(n, k) · p^k · (1 − p)^(n − k)

where C(n, k) = n! / (k!(n − k)!) is "n choose k," the number of ways to arrange k successes among n trials.

The three pieces have clean intuitions. p^k is the probability of one specific arrangement of k successes; (1 − p)^(n − k) is the probability that the remaining n − k trials are all failures; C(n, k) counts how many different orderings give exactly k successes.

A bar chart of a binomial probability mass function peaking around its mean — A coin and a tally sheet of heads and tails alongside a binomial PMF bar chart

The mean and variance of a binomial are short:

E(X) = np
Var(X) = np(1 − p)
σ = √(np(1 − p))

These come from treating X as the sum of n independent Bernoulli(p) trials, each with mean p and variance p(1 − p).

The Worked Example

A free-throw shooter makes 70% of her attempts. She takes 10 shots. The shots are independent, p = 0.7 stays the same, n = 10 is fixed, and each shot is made or missed — all four conditions hold, so X = number of made free throws follows Binomial(10, 0.7).

Mean and variance.

E(X) = np = 10 × 0.7 = 7
Var(X) = np(1 − p) = 10 × 0.7 × 0.3 = 2.1
σ = √2.1 ≈ 1.45

On average she makes 7 out of 10, with most outcomes in the 5–9 range.

P(X = 8): exactly 8 made.

P(X = 8) = C(10, 8) · (0.7)^8 · (0.3)^2 = 45 · 0.05765 · 0.09 ≈ 0.2335

C(10, 8) = 45 (choose which two shots she misses). 0.7^8 ≈ 0.05765 and 0.3^2 = 0.09. So there is about a 23% chance she makes exactly 8 of 10.

P(X ≥ 8): at least 8 made.

For "at least," sum P(X = 8) + P(X = 9) + P(X = 10).

P(X = 9) = C(10, 9) · 0.7^9 · 0.3^1 = 10 · 0.04035 · 0.3 ≈ 0.1211
P(X = 10) = C(10, 10) · 0.7^10 · 0.3^0 = 1 · 0.02825 · 1 ≈ 0.0282

P(X ≥ 8) ≈ 0.2335 + 0.1211 + 0.0282 ≈ 0.3828

So about 38% of the time she makes 8 or more of 10. About 62% of the time she makes 7 or fewer.

P(X ≤ 4): the complement trick.

For "at most 4," computing P(X = 0) + P(X = 1) + P(X = 2) + P(X = 3) + P(X = 4) by hand is tedious. A shorter route uses the complement when the cumulative probability is close to 1 from the upper side: P(X ≤ 4) = 1 − P(X ≥ 5). Both work; pick whichever needs the fewer terms.

When the Binomial Fails (and What to Use Instead)

Two relatives of the binomial cover the situations where one of the BINS conditions breaks:

Hypergeometric distribution — counts of successes in n draws without replacement from a finite population (e.g., five cards from a 52-card deck). p shifts after each draw, so independence fails. The hypergeometric formula uses combinations of the population, not n^k powers.
Negative binomial / geometric — instead of a fixed n, you keep going until a fixed number of successes (or one success, for geometric). Here it is the trial count that is random, not the success count.

If the conditions barely hold — say, p drifts slightly over trials — the binomial is often still a good approximation. If they break badly, switch models or simulate.

The Normal Approximation

For large n, the binomial distribution is approximately normal with the same mean np and standard deviation √(np(1 − p)). A common rule of thumb is that the approximation is reliable when both np ≥ 10 and n(1 − p) ≥ 10.

For the free-throw example, np = 7 and n(1 − p) = 3 — neither comfortably above 10, so the normal approximation is shaky here. With n = 100 and p = 0.7 (np = 70, n(1 − p) = 30) you could safely standardize and use a z-table to find binomial probabilities to a few decimal places. Use the exact formula when n is small and the normal approximation when n is large.

Getting Help

Once you are comfortable with binomial, the natural next step is the Poisson distribution explained, which handles counts of rare events when n is large and p is small. To build the probability rules that the binomial relies on under the hood, probability rules: addition and multiplication covers independence and the multiplication rule.

Conclusion

A binomial distribution counts successes in a fixed number of independent trials with a constant success probability. Check BINS first; if it holds, use P(X = k) = C(n, k) p^k (1 − p)^(n − k), and remember E(X) = np and Var(X) = np(1 − p). The free-throw walkthrough above produced P(exactly 8) ≈ 0.234 and P(at least 8) ≈ 0.383 — both straightforward once the formula is set up. For "at most" or "at least" problems, the complement is often the cleaner path.

Binomial Distribution Walkthrough: When to Use It, the Formula, and a Worked Example

The Four Conditions for a Binomial Setting

The Binomial Formula

The Worked Example

When the Binomial Fails (and What to Use Instead)

The Normal Approximation

Getting Help

Conclusion

Frequently Asked Questions

Clear study guides,
straight to your inbox.

The Four Conditions for a Binomial Setting

The Binomial Formula

The Worked Example

When the Binomial Fails (and What to Use Instead)

The Normal Approximation

Getting Help

Conclusion

Frequently Asked Questions

Keep reading

Poisson Distribution Explained: When It Is the Right Model and How to Use It

Can McGraw-Hill Detect Copy and Paste?

Clear study guides,straight to your inbox.

Clear study guides,
straight to your inbox.