The Normal Distribution
Definition:
The Normal Distribution
where
$\mu =$ the mean
$\sigma=$ the standard deviation of the distribution.
Remark
Properties of the Normal Distribution
The Empirical Rule and Chebyshev's Theorem
- approximately $68\%$ of the data falls within one standard deviation of the mean
- $95\%$ within two standard deviations
- $99.7\%$ within three standard deviations
Standardization
Definition:
Standard Normal Variable
$X=$ the value of the random variable
$\mu =$ the mean
$\sigma=$ the standard deviation of the distribution.
Remark
- Relocates the mean, $\mu$, to 0 .
- Rescales the standard deviation, $\sigma$, to 1 .
- Takes all values of $X$, and reconfigures them into values of $Z$. Positive z-scores indicate values above the mean, while negative z-scores represent values below the mean.
Example
For each of the following questions, use the $Z-$table to find the probability of the given event.
Example
For each of the following questions, use the $Z-$table to find the probability of the given event.
Example
For each of the following questions, use the $Z-$table to find the probability of the given event.
Example
For each of the following questions, find the value of $Z$ which satisfies the following inequalities.
Example
For each of the following questions, find the value of $Z$ which satisfies the following inequalities.
Example
For each of the following questions, find the value of $Z$ which satisfies the following inequalities.
Example
Let $X$ be a continuous random variable that is normally distributed with a mean of $\mu=5$ and a standard deviation of $\sigma=4$. Calculate the following:
Example
A software company tracks the response times of its servers to user requests. The response time (in milliseconds) is a random variable, $X$, that is normally distributed with a mean of 120 ms and a standard deviation of 15 ms . The company wants to ensure a fast user experience by analyzing the distribution of response times.
Example
In a physics laboratory, researchers are studying the speeds of particles traveling through a medium. The speed of the particles, $X$, is normally distributed with a mean of $2,500 m/s$ and a standard deviation of $200 m/s$.
Example
In a biology lab, scientists are studying the lengths of a specific type of leaf on a plant. The lengths, $X$, are normally distributed with a mean of $15\,cm$ and a standard deviation of $2.5\,cm$ .
Example
An Al company measures the time it takes for its machine learning model to process an image. The processing time, $X$, is normally distributed with a mean of 0.8 seconds and a standard deviation of 0.1 seconds.
The Normal Approximation to the Binomial Distribution
However, under certain conditions, the binomial distribution closely resembles a normal distribution. This similarity allows us to leverage the properties of the normal distribution to estimate binomial probabilities more efficiently.
Conditions for the Normal Approximation
1. $\quad n \cdot p \geq 5$
2. $\quad n \cdot (1-p) \geq 5$
These conditions ensure that the binomial distribution is approximately symmetric and bell-shaped, which is a characteristic of the normal distribution.
Correction Factors and Calculating Probabilities with the Normal Approximation
The table below outlines the appropriate correction factors based on the type of binomial probability being calculated. $$\begin{array}{|c|c|} \hline \text { Condition } & \text { Correction Factor } \\ \hline P(X=a) & P(a-0.5 < X < a+0.5) \\ P(X > a) & P(X > a+0.5) \\ P(X \geq a) & P(X > a-0.5) \\ P(X < a) & P(X < a-0.5) \\ P(X \leq a) & P(X < a+0.5) \\ \hline \end{array}$$
Example
Suppose a biologist is studying a population of beetles, where the probability of a beetle having a particular genetic trait is $p=0.3$.
Example
A physicist is testing a batch of 1,000 light-emitting diodes (LEDs). Each LED has a probability $p=$ 0.98 of functioning correctly.
Example
A computer scientist is testing a large batch of 2,000 processors for reliability. Each processor has a probability $p=0.995$ of passing a reliability test. The scientist wants to determine:
Example
An Al company is testing a new speech recognition algorithm using a dataset of audio clips. Each clip is classified as either ``recognized correctly`` or ``not recognized.`` Based on prior testing, the algorithm has a $95 \%$ chance ( $p=0.95$ ) of correctly recognizing a clip.