Questions tagged [pareto-distribution]

The Pareto distribution is a power distribution which was initially invented to describe the distribution of income but has found application in many other areas.

The Pareto distribution was developed in the late 19th century by Italian economist Vilfredo Pareto to describe the allocation of income and wealth among individuals. The distribution has found application in many other areas such as computing science, civil engineering, stock markets, physics, astronomy or insurance.

Its pdf is given by:

$$ f(x|a,b) = \frac{ba^b}{x^{b+1}} $$

for $x \geq a > 0$ and $b>0$, where $a$ provides the lower bound that a Pareto distributed random variable can take whilst $b$ determines the heaviness of the right tail.

The first two moments are:

\begin{align} &\text{mean: }& &\frac{ab}{b-1} \\[7pt] &\text{variance (when }b>2\text{): }& &\frac{ba^2}{(b-1)^2(b-2)} \end{align}

172 questions
27
votes
7 answers

How to calculate Zipf's law coefficient from a set of top frequencies?

I have several query frequencies, and I need to estimate the coefficient of Zipf's law. These are the top frequencies: 26486 12053 5052 3033 2536 2391 1444 1220 1152 1039
Diegolo
  • 289
  • 1
  • 3
  • 4
27
votes
2 answers

How do I fit a set of data to a Pareto distribution in R?

Have, let's say, the following data: 8232302 684531 116857 89724 82267 75988 63871 23718 1696 436 439 248 235 Want a simple way to fit this (and several other datasets) to a Pareto distribution. Ideally it would…
Felix
  • 659
  • 2
  • 6
  • 10
21
votes
1 answer

Is it possible to understand pareto/nbd model conceptually?

I am learning to use BTYD package that uses Pareto/NBD model to predict when will be a customer is expected to be back. However, all literature on this model is full of mathematics and there does not appear to be a simple/conceptual explanation of…
12
votes
3 answers

How to estimate parameters for Zipf truncated distribution from a data sample?

I have a problem with the estimation parameter for Zipf. My situation is the following: I have a sample set (measured from an experiment that generates calls that should follow a Zipf distribution). I have to demonstrate that this generator really…
Maurizio
  • 265
  • 2
  • 9
12
votes
3 answers

Central limit theorem and the Pareto distribution

Can somebody please provide a simple (lay person) explanation of the relationship between Pareto distributions and the Central Limit Theorem (e.g. does it apply? Why/ why not?)? I am trying to understand the following statement: "the Central Limit…
12
votes
1 answer

What distribution results in adding two Pareto distributions

I'm wondering what distribution results in adding two (or more) type-one Pareto distributions of the form $x^{-\alpha}$. Experimentally, it looks like a two-mode power-law, asymptotic to the difference of alphas.
AMG
  • 121
  • 1
  • 3
11
votes
3 answers

How to check if my data fits log normal distribution?

I'd like to check in R if my data fits log-normal or Pareto distributions. How could I do that? Perhaps ks.test could help me do that, but how could I get the $\alpha$ and $k$ parameters for Pareto distribution for my data?
11
votes
1 answer

Preventing Pareto smoothed importance sampling (PSIS-LOO) from failing

I recently started using Pareto smoothed importance sampling leave-one-out cross-validation (PSIS-LOO), described in these papers: Vehtari, A., & Gelman, A. (2015). Pareto smoothed importance sampling. arXiv preprint (link). Vehtari, A., Gelman,…
11
votes
0 answers

Hyperprior Noninformative Beta Binomial Model

I've been working through Gelman's Bayesian Data Analysis 3 text and have been trying to understand one of the hierarchical models revolving around rat tumors (Chapter 5). He uses a binomial model with p assigned a beta distribution. The Beta…
10
votes
1 answer

How to know if my data fits Pareto distribution?

I have a sample which is a vector with 220 numbers. Here is a link to a histogram of my data.. And I wish to check if my data fits a Pareto distribution, but I don't want to see QQ plots with that distribution, but I need an exact answer with…
stjudent
  • 565
  • 3
  • 6
  • 11
8
votes
1 answer

How to derive the $\alpha$ for the Pareto rule

Suppose we have the CDF for the Pareto Distribution given by: $$ P(X \leq x) = 1-\left(\frac{x_m}{x}\right)^\alpha \;\;\;\;\;\;\;\;\;\; x \geq x_m$$ What is the intuitive way to find the alpha for which the 80/20 rule holds?
kolonel
  • 332
  • 3
  • 10
8
votes
2 answers

How to fit a Pareto distribution to an observed CDF?

Background: I've been given a dataframe that contains data for a CDF. The column X contains the 250 $X$ values, and the column P contains $p(X\geq x)$. I paste the dataset below: X <-…
8
votes
1 answer

Required: Method of moments fitting routine for the two-parameter generalized Pareto

I am currently using the evd package which fits a two-parameter GPD by maximum likelihood. Since in small samples the MOM is superior to the ML estimation I'd like to give it a go. However, the POT package - which could do the job - is offline due…
Joz
  • 1,022
  • 7
  • 17
7
votes
1 answer

How should I interpret these strange density and mixing plots when fitting a generalised pareto distribution using MCMC with JAGS?

I'm trying to fit a generalised pareto distribution to a simulated dataset using JAGS and runjags. When doing so, I get very strange density and mixing plots for the mu parameter. The sigma and xi parameters are estimated as expected. How should I…
7
votes
1 answer

Problem with Pareto distribution and R

I am trying to test this property of pareto distribution: Let f(x) be a pareto distribution $$ f(x)=\alpha \frac{x_m^\alpha}{x^{\alpha+1}} $$ so we have the cdf that is $$ CDF(x)=\int_{x_m}^{x}\alpha…
emanuele
  • 2,008
  • 3
  • 21
  • 34
1
2 3
11 12