Continuous Random Variables and their Distributions

4.1.0 Continuous Random Variables and their Distributions

We have in fact already seen examples of continuous random variables before, e.g., Example 1.14. Let us look at the same example with just a little bit different wording.

Example

I choose a real number uniformly at random in the interval $[a,b]$, and call it $X$. By uniformly at random, we mean all intervals in $[a,b]$ that have the same length must have the same probability. Find the CDF of $X$.

Solution

As we mentioned, this is almost exactly the same problem as Example 1.14, with the difference being, in that problem, we considered the interval from $1$ to $2$. In that example, we saw that all individual points have probability $0$, i.e., $P(X=x)=0$ for all $x$. Also, the uniformity implies that the probability of an interval of length $l$ in $[a,b]$ must be proportional to its length: $$P(X \in [x_1,x_2]) \propto (x_2-x_1), \hspace{20pt} \textrm{where }a \leq x_1 \leq x_2 \leq b.$$ Since $ P(X \in [a,b])=1$, we conclude $$P(X \in [x_1,x_2]) =\frac{x_2-x_1}{b-a}, \hspace{20pt} \textrm{where }a \leq x_1 \leq x_2 \leq b.$$ Now, let us find the CDF. By definition $F_X(x)=P(X \leq x)$, thus we immediately have $$F_X(x) =0, \hspace{20pt} \textrm{for } x < a,$$ $$F_X(x) =1, \hspace{20pt} \textrm{for } x \geq b.$$ For $a \leq x \leq b$, we have

$F_X(x)$	$=P(X \leq x)$
	$=P(X \in [a,x])$
	$=\frac{x-a}{b-a}.$

Thus, to summarize \begin{equation} \hspace{70pt} F_X(x) = \left\{ \begin{array}{l l} 0 & \quad \textrm{for } x < a \\ \frac{x-a}{b-a} & \quad \textrm{for }a \leq x \leq b\\ 1 & \quad \textrm{for } x > b \end{array} \right. \hspace{70pt} (4.1) \end{equation} Note that here it does not matter if we use "$ < $" or "$ \leq $", as each individual point has probability zero, so for example $P(X < 2)=P(X \leq 2)$. Figure 4.1 shows the CDF of $X$. As we expect the CDF starts at zero and ends at $1$.

Fig.4.1 - CDF for a continuous random variable uniformly distributed over $[a,b]$.

One big difference that we notice here as opposed to discrete random variables is that the CDF is a continuous function, i.e., it does not have any jumps. Remember that jumps in the CDF correspond to points $x$ for which $P(X=x) > 0$. Thus, the fact that the CDF does not have jumps is consistent with the fact that $P(X=x)=0$ for all $x$. Indeed, we have the following definition for continuous random variables.

Definition
A random variable $X$ with CDF $F_X(x)$ is said to be continuous if $F_X(x)$ is a continuous function for all $x \in \mathbb{R}$.

We will also assume that the CDF of a continuous random variable is differentiable almost everywhere in $\mathbb{R}$.

← previous

The print version of the book is available on Amazon.

Practical uncertainty: Useful Ideas in Decision-Making, Risk, Randomness, & AI