14: Exponential Distribution

Author

Derek Sollberger

Published

February 22, 2023

Inequalities

Discrete Probability Distributions

The inclusive versus exclusive variation in inequalities matter in discrete probability distributions. With a random variable X defined over a support of k={0,1,2,3,...} number of trials,

P(X<k)=i=0k1P(i),P(Xk)=i=0kP(i)

P(X<32)=P(k=0)+P(k=1)++P(k=31)

P(X32)=P(k=0)+P(k=1)++P(k=31)+P(k=32)

Continuous Probability Distributions

The inclusive versus exclusive variation in inequalities do not matter with continuous probability distributions.

Claim: For a random variable X with probability density function f, P(X<b)=P(Xb)

Motivation

There are many situations that are better modeled with an open set [a,) of support, so we can look at the mother function

f(x)=ex,a<x

which has a horizontal asymptote. Next, a rate parameter λ (“lambda”) gives flexibility in models (i.e. the number we plug in for λ depends on the word problem).

f(x)=eλx,a<x

Normalization

Without much loss of generality, we can shift to a support of [0,). Our next goal is to rescale the function values so that the area under the curve is equal to 100 percent.

  • Find the value of k so that f is a probability density function.

Probability Density Function

Probability Density Function

Having found the value of the scalar k, our probability density function (PDF) is

PDF: f(x)={λeλx,x>00,otherwise

Cumulative Distribution Function

Cumulative Distribution Function

To continue to think of probabilities as area under a curve, we derive the cumulative distribution function (CDF) as the integral of the probability density function

F(x)=xf(t)dt=0xλeλtdt=1eλx,x>0

That is, the CDF of an exponential distribution is

F(x)={1eλx,x>00,x<0

 

The properties of probability include

  • We start with zero probability limxF(x)=0
  • We end with all probability limxF(x)=1

Conventions

Conventions

The models are related with λ=1β. Here in Math 32, we will use the model with the rate parameter λ (lambda) to match the convention used by the R programming language

Sample Statistics

Mean

Recall that the mean and the expected value are synonymous.

μ=E[X]=xf(x)dx=λ0xeλxdx=1λ

We have shown that the expected value for XExp(λ) is E[X]=1λ.

Variance

Further analogues to the formulas used for discrete probability distributions include the the second moment

E[X2]=x2f(x)dx=λ0x2eλxdx=2λ2

and it follows that the variance for an exponential distribution with rate parameter λ is

σ2=Var(X)=E[X2](E[X])2=2λ2(1λ)2=1λ2

As usual, the standard deviation is the square root of the variance.

σ=1λ2=1λ

Median

Back in discrete data, if we had an ordered list of data, {x(1),x(2),x(3),...} we said that the median is the value in the ``middle’’ of the list.

Query: How do you think we define a median here in the setting of continuous distributions?

Median

Back in discrete data, if we had an ordered list of data, {x(1),x(2),x(3),...} we said that the median is the value in the ``middle’’ of the list.

Waiting Times

Let us now return to the notion of waiting times. Suppose that a friend of yours is going to pick you up for a carpool, and you estimate that he tends to arrive with a mean time of 30 minutes. Assume an exponential distribution.

Careful!

Since μ=1λ30=1λλ=130

Compute the probability that your friend will arrive in less than 25 minutes.

Compute the probability that your friend will take more than 40 minutes to arrive.

Memoryless Property

You inherit an exquisite cabin in the woods, but on one condition: you must stay in the cabin overnight on the witch’s sabbath—Halloween. The cabin is notorious for housing the ghost of Cal Kulas, and he strikes sometime after the stroke of midnight with a mean time of 60 minutes.

  • Given that you have already waited 32 minutes to see the ghost, what is the probability that you will have to wait at least another 10 minutes?
  • Given that you have already waited 181 minutes to see the ghost, what is the probability that you will have to wait at least another 10 minutes?

Claim: An exponential distribution has the memoryless property P(T>a+b|T>b)=P(T>a)

Proof:

P(T>a+b|T>b)=P(T>a+b and T>b)P(T>b) =P(T>a+b)P(T>b) =eλ(a+b)eλb =eλa =P(T>a)

Claim: The exponential distribution is the only continuous distribution with the memoryless property.

Proof: (See Math 181)

Looking Ahead

  • due soon
    • WHW6
    • JHW4
    • Mid-Semester Survey
  • Be mindful of before-lecture quizzes

Exam 1 will be on Wed., Mar. 1

  • more information in weekly announcements

No lecture session for Math 32:

  • Mar 10, Mar 24

art by @38mo1