ℹ️ Skipped - page is already crawled
| Filter | Status | Condition | Details |
|---|---|---|---|
| HTTP status | PASS | download_http_code = 200 | HTTP 200 |
| Age cutoff | PASS | download_stamp > now() - 6 MONTH | 0.1 months ago |
| History drop | PASS | isNull(history_drop_reason) | No drop reason |
| Spam/ban | PASS | fh_dont_index != 1 AND ml_spam_score = 0 | ml_spam_score=0 |
| Canonical | PASS | meta_canonical IS NULL OR = '' OR = src_unparsed | Not set |
| Property | Value |
|---|---|
| URL | https://www.statisticshowto.com/zero-inflated-poisson-distribution/ |
| Last Crawled | 2026-04-08 22:18:34 (2 days ago) |
| First Indexed | 2024-05-14 14:38:22 (1 year ago) |
| HTTP Status Code | 200 |
| Meta Title | Zero-Inflated Poisson distribution - Statistics How To |
| Meta Description | The zero-inflated Poisson distribution (also called the zip distribution) is used to model count data that has an excess of zero counts |
| Meta Canonical | null |
| Boilerpipe Text | <
List of probability distributions
< Zero-inflated Poisson distribution
The
zero-inflated Poisson distribution
(also called the
zip distribution
) is a generalization of the regular Poisson distribution to account for extra zeros. It’s often prefered to the Poisson distribution because some datasets contain numerous zeros [1].
The zip model has a wide range of applications in fields such as business, ecology, economics, and epidemiology. For example, it could be used to model a company’s employee attendance — especially if there seems to be a large amount of absences (i.e., zeros in the dataset). Predictors of the number of days of absence in the current year could include lower scores on yearly appraisals or a history of arriving late to work. The model can also be used to account for overdispersion in count data, by assuming that there are two types of data points: those with a zero count of probability
p
, and those with a nonzero count of probability 1-
p
.
Zip distribution histograms [2].
The
probability mass function (PMF)
of the ZIP distribution is [3]
where ≤ π ≤ 1 and λ ≥ 0.
The λ parameter forces the distribution to inflate the zeros; when λ = 0, the zero-inflated Poisson distribution reduces to the Poisson distribution.
Zero-inflated vs. zero-modified distributions
The main difference between a zero-inflated model and a zero-modified model lies in how they handle excess zeroes in data. Zip models separate excess zeroes into components, while zero-modified models modify the count distribution to account for excess zeroes.Â
In the
zero-inflated distribution
, it is assumed that the data contains two types of zeroes with different generating processes:
Structural zeros: the true absence of events,
Sampling zeroes: a result of chance.
The excess zeroes are modeled separately from non-zero counts using a
mixture distribution.
On the other hand, a zero-modified model assumes that the counts follow a known distribution (such as a Poisson distribution or negative binomial distribution). An additional modification term, which accounts for excess zeroes, could be:
An additional mass at zero (i.e., there is a probability of generating a zero that is greater than what the distribution would normally generate), or
An additional distribution that generates zero counts.
Which model you choose depends on the type of data you’re working with and the research question being addressed.
Zero-inflated Poisson regression
In the last few years, zero-inflated distributions have become popular in regression analysis. This popularity is perhaps due to Lambert’s influential paper [4], which showed that ZIP regression is better than Poisson regression when it comes to fitting data with many zeros. Zero-inflated models most likely originated from the econometrics field [5].
The main difference between a zero-inflated Poisson
distribution
and zero-inflated Poisson
regression
lies in their application. Zero-inflated Poisson distribution is a probability distribution used to model count data with a significant proportion of zero counts. On the other hand, zero-inflated Poisson regression extends the standard Poisson regression to model zero inflation in data. Instead of assuming that the count data follows a standard Poisson, the model assumes that data is generated from a mixture of two processes:
One process that generates zero counts
One process that generates count data from a Poisson distribution.
Zero-inflated Poisson regression allows for estimation of model parameters as well as testing hypotheses about the significance of
predictors
.
References
[1] D. Böhning, “Zero-inflated Poisson models and C.A.MAN: a tutorial collection of
evidence”, Biometric Model 40:7 (1998), 833–843. Zbl 0914.62091
[2] Synergy42, CC BY-SA 4.0
https://creativecommons.org/licenses/by-sa/4.0
, via Wikimedia Commons
[3] Becket, S. et. al. Zero-inflated Poisson (ZIP) distribution: parameter estimation and applications to
model data from natural calamities. Involve Journal of Mathematics. 2014. Vol 7. No. 6.
[4] Lambert, D. (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing.Â
Technometrics
, 34, 1-14.
[5] Riddout, M. Zero-inflated models. Retrieved May 4, 2023 from: https://www.kent.ac.uk/smsas/personal/msr/webfiles/zip/zip.html |
| Markdown | [Skip to content](https://www.statisticshowto.com/zero-inflated-poisson-distribution/#content "Skip to content")
[Statistics How To](https://www.statisticshowto.com/)
Menu
- [Home](https://www.statisticshowto.com/)
- [Tables](https://www.statisticshowto.com/tables/)
- [Binomial Distribution Table](https://www.statisticshowto.com/tables/binomial-distribution-table/)
- [F Table](https://www.statisticshowto.com/tables/f-table/)
- [Inverse T Distribution Table](https://www.statisticshowto.com/tables/inverse-t-distribution-table/)
- [PPMC Critical Values](https://www.statisticshowto.com/tables/ppmc-critical-values/)
- [T-Distribution Table (One Tail and Two-Tails)](https://www.statisticshowto.com/tables/t-distribution-table/)
- [Chi Squared Table (Right Tail)](https://www.statisticshowto.com/tables/chi-squared-table-right-tail/)
- [Z-table (Right of Curve or Left)](https://www.statisticshowto.com/tables/z-table/)
- [Probability and Statistics](https://www.statisticshowto.com/probability-and-statistics/)
- [Binomials](https://www.statisticshowto.com/probability-and-statistics/binomial-theorem/)
- [Chi-Square Statistic](https://www.statisticshowto.com/probability-and-statistics/chi-square/)
- [Expected Value](https://www.statisticshowto.com/probability-and-statistics/expected-value/)
- [Hypothesis Testing](https://www.statisticshowto.com/probability-and-statistics/hypothesis-testing/)
- [Non Normal Distribution](https://www.statisticshowto.com/probability-and-statistics/non-normal-distributions/)
- [Normal Distributions](https://www.statisticshowto.com/probability-and-statistics/normal-distributions/)
- [Probability](https://www.statisticshowto.com/probability-and-statistics/probability-main-index/)
- [Regression Analysis](https://www.statisticshowto.com/probability-and-statistics/regression-analysis/)
- [Statistics Basics](https://www.statisticshowto.com/statistics-basics/)
- [T-Distribution](https://www.statisticshowto.com/probability-and-statistics/t-distribution/)
- [Multivariate Analysis & Independent Component](https://www.statisticshowto.com/probability-and-statistics/multivariate-analysis/)
- [Sampling](https://www.statisticshowto.com/probability-and-statistics/sampling-in-statistics/)
- [Calculators](https://www.statisticshowto.com/calculators/)
- [Variance and Standard Deviation Calculator](https://www.statisticshowto.com/calculators/variance-and-standard-deviation-calculator/)
- [Tdist Calculator](https://www.statisticshowto.com/calculators/tdist-calculator/)
- [Permutation Calculator / Combination Calculator](https://www.statisticshowto.com/calculators/permutation-calculator-and-combination-calculator/)
- [Interquartile Range Calculator](https://www.statisticshowto.com/calculators/interquartile-range-calculator/)
- [Linear Regression Calculator](https://www.statisticshowto.com/calculators/linear-regression-calculator/)
- [Expected Value Calculator](https://www.statisticshowto.com/calculators/expected-value-calculator/)
- [Binomial Distribution Calculator](https://www.statisticshowto.com/calculators/binomial-distribution-calculator/)
- [Matrices](https://www.statisticshowto.com/matrices-and-matrix-algebra/)
- [Experimental Design](https://www.statisticshowto.com/experimental-design/)
- [Calculus Based Statistics](https://www.statisticshowto.com/probability-and-statistics/calculus-based-statistics/)
[Statistics How To](https://www.statisticshowto.com/)
Menu
- [Home](https://www.statisticshowto.com/)
- [Tables](https://www.statisticshowto.com/tables/)
- [Binomial Distribution Table](https://www.statisticshowto.com/tables/binomial-distribution-table/)
- [F Table](https://www.statisticshowto.com/tables/f-table/)
- [Inverse T Distribution Table](https://www.statisticshowto.com/tables/inverse-t-distribution-table/)
- [PPMC Critical Values](https://www.statisticshowto.com/tables/ppmc-critical-values/)
- [T-Distribution Table (One Tail and Two-Tails)](https://www.statisticshowto.com/tables/t-distribution-table/)
- [Chi Squared Table (Right Tail)](https://www.statisticshowto.com/tables/chi-squared-table-right-tail/)
- [Z-table (Right of Curve or Left)](https://www.statisticshowto.com/tables/z-table/)
- [Probability and Statistics](https://www.statisticshowto.com/probability-and-statistics/)
- [Binomials](https://www.statisticshowto.com/probability-and-statistics/binomial-theorem/)
- [Chi-Square Statistic](https://www.statisticshowto.com/probability-and-statistics/chi-square/)
- [Expected Value](https://www.statisticshowto.com/probability-and-statistics/expected-value/)
- [Hypothesis Testing](https://www.statisticshowto.com/probability-and-statistics/hypothesis-testing/)
- [Non Normal Distribution](https://www.statisticshowto.com/probability-and-statistics/non-normal-distributions/)
- [Normal Distributions](https://www.statisticshowto.com/probability-and-statistics/normal-distributions/)
- [Probability](https://www.statisticshowto.com/probability-and-statistics/probability-main-index/)
- [Regression Analysis](https://www.statisticshowto.com/probability-and-statistics/regression-analysis/)
- [Statistics Basics](https://www.statisticshowto.com/statistics-basics/)
- [T-Distribution](https://www.statisticshowto.com/probability-and-statistics/t-distribution/)
- [Multivariate Analysis & Independent Component](https://www.statisticshowto.com/probability-and-statistics/multivariate-analysis/)
- [Sampling](https://www.statisticshowto.com/probability-and-statistics/sampling-in-statistics/)
- [Calculators](https://www.statisticshowto.com/calculators/)
- [Variance and Standard Deviation Calculator](https://www.statisticshowto.com/calculators/variance-and-standard-deviation-calculator/)
- [Tdist Calculator](https://www.statisticshowto.com/calculators/tdist-calculator/)
- [Permutation Calculator / Combination Calculator](https://www.statisticshowto.com/calculators/permutation-calculator-and-combination-calculator/)
- [Interquartile Range Calculator](https://www.statisticshowto.com/calculators/interquartile-range-calculator/)
- [Linear Regression Calculator](https://www.statisticshowto.com/calculators/linear-regression-calculator/)
- [Expected Value Calculator](https://www.statisticshowto.com/calculators/expected-value-calculator/)
- [Binomial Distribution Calculator](https://www.statisticshowto.com/calculators/binomial-distribution-calculator/)
- [Matrices](https://www.statisticshowto.com/matrices-and-matrix-algebra/)
- [Experimental Design](https://www.statisticshowto.com/experimental-design/)
- [Calculus Based Statistics](https://www.statisticshowto.com/probability-and-statistics/calculus-based-statistics/)
# Zero-Inflated Poisson distribution
\< [List of probability distributions](https://www.statisticshowto.com/probability-and-statistics/statistics-definitions/probability-distribution/) \< Zero-inflated Poisson distribution
The **zero-inflated Poisson distribution** (also called the *zip distribution*) is a generalization of the regular Poisson distribution to account for extra zeros. It’s often prefered to the Poisson distribution because some datasets contain numerous zeros \[1\].
The zip model has a wide range of applications in fields such as business, ecology, economics, and epidemiology. For example, it could be used to model a company’s employee attendance — especially if there seems to be a large amount of absences (i.e., zeros in the dataset). Predictors of the number of days of absence in the current year could include lower scores on yearly appraisals or a history of arriving late to work. The model can also be used to account for overdispersion in count data, by assuming that there are two types of data points: those with a zero count of probability *p*, and those with a nonzero count of probability 1-*p*.
## Zero-inflated Poisson distribution PMF

Zip distribution histograms \[2\].
The [probability mass function (PMF)](https://www.statisticshowto.com/probability-mass-function-pmf-definition-examples/) of the ZIP distribution is \[3\]

where ≤ π ≤ 1 and λ ≥ 0.
The λ parameter forces the distribution to inflate the zeros; when λ = 0, the zero-inflated Poisson distribution reduces to the Poisson distribution.
## Zero-inflated vs. zero-modified distributions
The main difference between a zero-inflated model and a zero-modified model lies in how they handle excess zeroes in data. Zip models separate excess zeroes into components, while zero-modified models modify the count distribution to account for excess zeroes.
In the **zero-inflated distribution**, it is assumed that the data contains two types of zeroes with different generating processes:
- Structural zeros: the true absence of events,
- Sampling zeroes: a result of chance.
The excess zeroes are modeled separately from non-zero counts using a [mixture distribution.](https://www.statisticshowto.com/mixture-distribution/)
On the other hand, a zero-modified model assumes that the counts follow a known distribution (such as a Poisson distribution or negative binomial distribution). An additional modification term, which accounts for excess zeroes, could be:
- An additional mass at zero (i.e., there is a probability of generating a zero that is greater than what the distribution would normally generate), or
- An additional distribution that generates zero counts.
Which model you choose depends on the type of data you’re working with and the research question being addressed.
## Zero-inflated Poisson regression
In the last few years, zero-inflated distributions have become popular in regression analysis. This popularity is perhaps due to Lambert’s influential paper \[4\], which showed that ZIP regression is better than Poisson regression when it comes to fitting data with many zeros. Zero-inflated models most likely originated from the econometrics field \[5\].
The main difference between a zero-inflated Poisson *distribution* and zero-inflated Poisson *regression* lies in their application. Zero-inflated Poisson distribution is a probability distribution used to model count data with a significant proportion of zero counts. On the other hand, zero-inflated Poisson regression extends the standard Poisson regression to model zero inflation in data. Instead of assuming that the count data follows a standard Poisson, the model assumes that data is generated from a mixture of two processes:
- One process that generates zero counts
- One process that generates count data from a Poisson distribution.
Zero-inflated Poisson regression allows for estimation of model parameters as well as testing hypotheses about the significance of [predictors](https://www.statisticshowto.com/independent-variable-definition/#Predictor).
#### References
\[1\] D. Böhning, “Zero-inflated Poisson models and C.A.MAN: a tutorial collection of
evidence”, Biometric Model 40:7 (1998), 833–843. Zbl 0914.62091
\[2\] Synergy42, CC BY-SA 4.0 <https://creativecommons.org/licenses/by-sa/4.0>, via Wikimedia Commons
\[3\] Becket, S. et. al. Zero-inflated Poisson (ZIP) distribution: parameter estimation and applications to
model data from natural calamities. Involve Journal of Mathematics. 2014. Vol 7. No. 6.
\[4\] Lambert, D. (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. *Technometrics*, 34, 1-14.
\[5\] Riddout, M. Zero-inflated models. Retrieved May 4, 2023 from: https://www.kent.ac.uk/smsas/personal/msr/webfiles/zip/zip.html
[Delaporte Distribution](https://www.statisticshowto.com/delaporte-distribution/)
[Tweedie Distribution: Definition and Examples](https://www.statisticshowto.com/tweedie-distribution/)
**Comments? Need to post a correction?** Please [***Contact Us***](https://www.statisticshowto.com/contact/).
### Leave a Comment
You must be [logged in](https://www.statisticshowto.com/sht-dash/?redirect_to=https%3A%2F%2Fwww.statisticshowto.com%2Fzero-inflated-poisson-distribution%2F) to post a comment.
Check out our **["Practically Cheating Statistics Handbook]()**, which gives you hundreds of easy-to-follow answers in a convenient e-book.
[](https://www.statisticshowto.com/?page_id=15743&preview=true)
***
**Readers are calling it suspenseful, haunting, and unforgettable. Check out Stephanie’s debut post-apocalyptic novel — available now on Amazon.**
[](https://amzn.to/3LLzICB)
Looking for elementary [**statistics help**](https://www.statisticshowto.com/help-with-statistics-equations/)? You’ve come to the right place. **Statistics How To** has more than 1,000 articles and **videos** for elementary statistics, probability, AP and advanced statistics topics. **Looking for a specific topic?** Type it into the search box at the top of the page.
Latest articles
- [Hedges’ g: Definition, Formula](https://www.statisticshowto.com/hedges-g/)
- [Kaiser-Meyer-Olkin (KMO) Test for Sampling Adequacy](https://www.statisticshowto.com/kaiser-meyer-olkin/)
- [Benjamini-Hochberg Procedure](https://www.statisticshowto.com/benjamini-hochberg-procedure/)
- [Tetrachoric Correlation: Definition, Examples, Formula](https://www.statisticshowto.com/tetrachoric-correlation/)
- [Random Variable: What is it in Statistics?](https://www.statisticshowto.com/random-variable/)
- [Maximum Likelihood and Maximum Likelihood Estimation](https://www.statisticshowto.com/maximum-likelihood-estimation/)
- [What is a Parameter in Statistics?](https://www.statisticshowto.com/what-is-a-parameter-in-statistics/)
- [Beta Level: Definition & Examples](https://www.statisticshowto.com/beta-level/)
- [Pairwise Independent, Mutually Independent: Definition, Example](https://www.statisticshowto.com/pairwise-independent-mutually/)
- [Population Mean Definition, Example, Formula](https://www.statisticshowto.com/population-mean/)
- [Dispersion / Measures of Dispersion: Definition](https://www.statisticshowto.com/dispersion/)
- [Serial Correlation / Autocorrelation: Definition, Tests](https://www.statisticshowto.com/serial-correlation-autocorrelation/)
© 2026 [**Statistics How To**](https://www.statisticshowto.com/) \| [About Us](https://www.statisticshowto.com/contact/) \| [Privacy Policy](https://www.statisticshowto.com/privacy-policy/) \| [Terms of Use](https://www.statisticshowto.com/terms-of-use/) |
| Readable Markdown | \< [List of probability distributions](https://www.statisticshowto.com/probability-and-statistics/statistics-definitions/probability-distribution/) \< Zero-inflated Poisson distribution
The **zero-inflated Poisson distribution** (also called the *zip distribution*) is a generalization of the regular Poisson distribution to account for extra zeros. It’s often prefered to the Poisson distribution because some datasets contain numerous zeros \[1\].
The zip model has a wide range of applications in fields such as business, ecology, economics, and epidemiology. For example, it could be used to model a company’s employee attendance — especially if there seems to be a large amount of absences (i.e., zeros in the dataset). Predictors of the number of days of absence in the current year could include lower scores on yearly appraisals or a history of arriving late to work. The model can also be used to account for overdispersion in count data, by assuming that there are two types of data points: those with a zero count of probability *p*, and those with a nonzero count of probability 1-*p*.

Zip distribution histograms \[2\].
The [probability mass function (PMF)](https://www.statisticshowto.com/probability-mass-function-pmf-definition-examples/) of the ZIP distribution is \[3\]

where ≤ π ≤ 1 and λ ≥ 0.
The λ parameter forces the distribution to inflate the zeros; when λ = 0, the zero-inflated Poisson distribution reduces to the Poisson distribution.
## Zero-inflated vs. zero-modified distributions
The main difference between a zero-inflated model and a zero-modified model lies in how they handle excess zeroes in data. Zip models separate excess zeroes into components, while zero-modified models modify the count distribution to account for excess zeroes.
In the **zero-inflated distribution**, it is assumed that the data contains two types of zeroes with different generating processes:
- Structural zeros: the true absence of events,
- Sampling zeroes: a result of chance.
The excess zeroes are modeled separately from non-zero counts using a [mixture distribution.](https://www.statisticshowto.com/mixture-distribution/)
On the other hand, a zero-modified model assumes that the counts follow a known distribution (such as a Poisson distribution or negative binomial distribution). An additional modification term, which accounts for excess zeroes, could be:
- An additional mass at zero (i.e., there is a probability of generating a zero that is greater than what the distribution would normally generate), or
- An additional distribution that generates zero counts.
Which model you choose depends on the type of data you’re working with and the research question being addressed.
## Zero-inflated Poisson regression
In the last few years, zero-inflated distributions have become popular in regression analysis. This popularity is perhaps due to Lambert’s influential paper \[4\], which showed that ZIP regression is better than Poisson regression when it comes to fitting data with many zeros. Zero-inflated models most likely originated from the econometrics field \[5\].
The main difference between a zero-inflated Poisson *distribution* and zero-inflated Poisson *regression* lies in their application. Zero-inflated Poisson distribution is a probability distribution used to model count data with a significant proportion of zero counts. On the other hand, zero-inflated Poisson regression extends the standard Poisson regression to model zero inflation in data. Instead of assuming that the count data follows a standard Poisson, the model assumes that data is generated from a mixture of two processes:
- One process that generates zero counts
- One process that generates count data from a Poisson distribution.
Zero-inflated Poisson regression allows for estimation of model parameters as well as testing hypotheses about the significance of [predictors](https://www.statisticshowto.com/independent-variable-definition/#Predictor).
#### References
\[1\] D. Böhning, “Zero-inflated Poisson models and C.A.MAN: a tutorial collection of
evidence”, Biometric Model 40:7 (1998), 833–843. Zbl 0914.62091
\[2\] Synergy42, CC BY-SA 4.0 <https://creativecommons.org/licenses/by-sa/4.0>, via Wikimedia Commons
\[3\] Becket, S. et. al. Zero-inflated Poisson (ZIP) distribution: parameter estimation and applications to
model data from natural calamities. Involve Journal of Mathematics. 2014. Vol 7. No. 6.
\[4\] Lambert, D. (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. *Technometrics*, 34, 1-14.
\[5\] Riddout, M. Zero-inflated models. Retrieved May 4, 2023 from: https://www.kent.ac.uk/smsas/personal/msr/webfiles/zip/zip.html |
| Shard | 150 (laksa) |
| Root Hash | 12746512124316119150 |
| Unparsed URL | com,statisticshowto!www,/zero-inflated-poisson-distribution/ s443 |