Random Variables
We talked before about probability using simple sample spaces, but in reality, we will map situations to probabilistic models that cater to all possible results. The variables to which we are going to construct probabilistic models are called random variables.
A random variable is a real-valued variable whose value is determined by an underlying random experiment. We usually show random variables by capital letters such as
Discrete Random Variables
Example: Imagine that we extracted 2 balls without repositioning from an urn that initially contained 2 blue balls and 3 violet balls. Let’s define
Result | Probability | X |
---|---|---|
BB | 1/10 | 0 |
BV | 3/10 | 1 |
VB | 3/10 | 1 |
VV | 3/10 | 2 |
Each result of the experiment is associated with a value of
In the table below, we have the probability distribution of
x | p(x) | |
---|---|---|
0 | 1/10 | |
1 | 6/10 | |
2 | 3/10 |
Special distributions
Some random variables adapt very well to a series of practical problems, and they have been given unique names.
Discrete Uniform distribution
Each value has the same probability of happening. X is uniformly distributed if and only if
Bernoulli
It can only take two possible values, usually 0 and 1. This random variable models random experiments that have two possible outcomes, sometimes referred to as “success” and “failure”.
A random variable
Example: A die is tossed, and we are interested at die showing 5, so it either it shows a 5 or not.
Binomial distribution
Assume that we repeat a Bernoulli experiment n times, also, assume that the repetitions are independent, i.e., the result of an experiment won’t impact another one. A random variable
Example: A coin is thrown 3 times. What is the probability of 2 heads?
Geometric distribution
Repeating independent Bernoulli trials until observing the first success. A random variable
Poisson distribution
Used in scenarios where we are counting the occurrences of certain events in an interval of time or space. A random variable
Example: A phone number receives, on average, 5 calls per minute. What is the probability of not receiving a call in one minute?
Continuous Random Variables
A continuous random variable is one which takes an infinite number of possible values; It is defined over an interval of values, and is represented by the area under a curve. The probability of observing any single value is equal to 0, since the number of values which may be assumed by the random variable is infinite.
Special distributions
We also have special distributions for continuous random variables.
Uniform distribution
A random variable
Exponential distribution
A random variable
Normal (Gaussian) distribution
The normal distribution is by far the most important probability distribution. One of the main reasons for that is the Central Limit Theorem (CLT) that we will discuss in another moment.
A random variable
Below we can see the representation of a normal distribution with mean
When
If