# Science:MATH105 Probability/Lesson 2 CRV/2.07 The Beta PDF

Let us consider some common continuous random variables that often arise in practice. We should stress that this is indeed a very small sample of common continuous distributions.

## The Beta Distribution

Suppose the proportion p of restaurants that make a profit in their first year of operation is given by a certain beta random variable X, with probability density function:

${\displaystyle f(p)={\begin{cases}12p(1-p)^{2}&{\text{if }}0\leq x\leq 1,\\0&{\text{elsewhere}}.\end{cases}}}$

What is the probability that more than half of the restaurants will make a profit during their first year of operation? To answer this question, we calculate the probability as an area under the PDF curve as follows:

{\displaystyle {\begin{aligned}\mathrm {Pr} (0.5\leq X\leq 1)&=\int _{0.5}^{1}f(p)dp\\&=\int _{0.5}^{1}12p(1-p)^{2}dp\\&=\int _{0.5}^{1}\left(12p-24p^{2}+12p^{3}\right)dp\\&=6p^{2}-8p^{3}+3p^{4}{\Big |}_{0.5}^{1}\\&=(6-8+3)-(1.5-1+0.1875)\\&=0.3125\end{aligned}}}

Therefore, Pr(0.5 ≤ P ≤ 1) = 0.3125.

The example above is a particular case of a beta random variable. In general, a beta random variable has the generic PDF:

${\displaystyle f(x)={\begin{cases}kx^{a-1}(1-x)^{b-1}&{\text{if }}0\leq x\leq 1,\\0&{\text{elsewhere}}\end{cases}}}$

where the constants a and b are greater than zero, and the constant k is chosen so that the density f integrates to 1.

We see that our previous example was a beta random variable given by the above density with a = 2 and b = 3. Let us find the associated cumulative distribution function F(p) for this random variable. We compute:

{\displaystyle {\begin{aligned}F(p)&=\int _{-\infty }^{p}f(t)dt\\&=\int _{0}^{p}12t(1-t)^{2}dt\\&=12\int _{0}^{p}(t-2t^{2}+t^{3})dt\\&=12{\Big (}{\frac {1}{2}}t^{2}-{\frac {2}{3}}t^{3}+{\frac {1}{4}}t^{4}{\Big )}{\Big |}_{0}^{p}\\&=p^{2}(6-8p+3p^{2}),\end{aligned}}}

valid for 0 ≤ p ≤ 1.

## The Exponential Distribution

The lifespan of a lightbulb can be modeled by a continuous random variable since lifespan - i.e. time - is a continuous quantity. A reasonable distribution for this random variable is what is known as an exponential distribution.

A random variable Y has an exponential distribution with parameter β > 0 if its PDF is given by ${\displaystyle f(y)={\begin{cases}{\frac {1}{\beta }}e^{-y/\beta }&{\text{if }}0\leq y<\infty \\0&{\text{elsewhere}}\end{cases}}}$

Suppose that the lifespan (in months) of lightbulbs manufactured at a certain facility can be modeled by an exponential random variable Y with parameter β = 4. What is the probability that a particular lightbulb lasts at least a year? Again, we can calculate this probability by evaluating an integral. Since there are 12 months in one year, we calculate

{\displaystyle {\begin{aligned}\mathrm {Pr} (Y\geq 12)&=\int _{12}^{\infty }f(y)dy\\&=\int _{12}^{\infty }{\frac {1}{4}}e^{-y/4}dy\\&=-e^{-y/4}{\Big |}_{12}^{infty}\\&=0-(-e^{-3})\\&\approx 0.04979\end{aligned}}}

Thus we can see that it is highly likely we would need to replace a lightbulb produced from this facility within one year of manufacture.

## The Continuous Uniform Distribution

Our third example of a common continuous random variable is one that we have already encountered. Consider the experiment of randomly choosing a real number from the interval [a,b]. Letting X denote this random outcome, we say that X has a continuous uniform distribution on [a,b] if the probability that we choose a value in some subinterval of [a,b] is given by the relative size of that subinterval in [a,b]. More explicitly, we have the following:

A random variable X has an continuous uniform distribution on [a,b] if its PDF is constant on [a,b]; i.e. its PDF is given by ${\displaystyle f(x)={\begin{cases}{\frac {1}{b-a}}&{\text{if }}a\leq x\leq b\\0&{\text{elsewhere}}\end{cases}}}$

The continuous uniform distribution has a particularly simple representation, just as its discrete counterpart does. Nevertheless, this random variable has great practical and theoretical utility. We will explore this distribution in more detail in the exercises.

For the purposes of MATH 105, students are not expected to memorize the formulae for the probability density functions introduced in this section, but may need to use them to complete assigned work.