Math 32 - 26: Likelihood

Notation

Recall,

Lower-case ${x_{1}, x_{2}, x_{3}, . . ., x_{n}}$ is a set of observations
Upper-case ${X_{1}, X_{2}, X_{3}, . . ., X_{n}}$ is a set of random variables (i.e. a data set)
Treating ${X_{1}, X_{2}, . . ., X_{n}}$ as a set of $n$ i.i.d. (independent and identically distributed) random variables is a common assumption.
With independence, $P (X_{1}, X_{2}, . . ., X_{n}) = P (X_{1}) \cdot P (X_{2}) \cdot . . . \cdot P (X_{n})$
Each individual probability is computed (at least theoretically) with a PDF (probability density function) $P (x_{i}) = f_{X} (x_{i})$

Inverse

Suppose that we have a sample of data ${x_{1}, x_{2}, x_{3}, . . ., x_{n}}$ . Now we want to model with a probability distribution, but we need to figure out the distribution’s parameters. Let us think about this in a Bayesian way:

$P (model | data) = \frac{P (data | model) \cdot P (model)}{P (data)}$

$P (model | data)$ is the posterior probability that we want
$P (data | model)$ is a likelihood
Since the prior probability $P (data)$ is a constant …

… we say that the posterior probability is proportional to the likelihood.

Likelihood

Likelihood Function

Let the likelihood function, in terms of a parameter $θ$ , be the joint probability

$L (θ) = P (X_{1} = x_{1}, X_{2} = x_{2}, . . ., X_{n} = x_{n}) = f_{X} (x_{1}) \cdot f_{X} (x_{2}) \dots f_{X} (x_{n})$

$L (θ; {x_{i}}_{i = 1}^{n}) = \prod_{i = 1}^{n} f_{X} (x_{i})$

Suppose that we have data for how long a certain type and brand of light bulb operated (in the same working conditions), and that data in months was

$6, 18, 29, 44, 48$ Goal: characterize the top 5 percent of light bulbs.

Build the likelihood function assuming an exponential distribution.
Compute the likelihood that $μ = 25$ .
Compute the likelihood that $μ = 50$ .

Log Likelihood

Logarithms

You know that logarithms make large numbers smaller. More precisely, $\ln (x) < x, x > 1$

Example: $\ln (1234) \approx 7.1180$

Did you know that logarithms make small numbers larger (in size). More precisely, $| \ln (x) | > x, 0 < x < 1$

Example: $| \ln (0.1234) | \approx 2.0923$

From pre-calculus, recall the properties of logarithms: $\ln (A B) = \ln (A) + \ln (B), \ln (\frac{A}{B}) = \ln A - \ln B, \ln (A^{c}) = c \ln A$

For modeling with the exponential distribution, we saw that the likelihood function was

$L (λ; {x_{i}}_{i = 1}^{n}) = \prod_{i = 1}^{n} f_{X} (x_{i}) = λ^{n} e^{- λ \sum x_{i}}$

We take the natural logarithm to compute the log-likelihood function

$ℓ (λ; {x_{i}}_{i = 1}^{n}) = \ln L (λ; {x_{i}}_{i = 1}^{n}) = n \ln λ - λ \sum_{i = 1}^{n} x_{i}$

Compute the log-likelihood that $μ = 25$ .
Compute the log-likelihood that $μ = 50$ .

Visuals

Looking Ahead

WHW9
Exam 2, Mon., Apr. 10
- more information in weekly announcement

tweet source