17 Normal Distribution Concept Overview

Report 0 Downloads 101 Views
NORMAL DISTRIBUTION | CONCEPT OVERVIEW The TOPIC of NORMAL DISTRIBUTION can be referenced on page 39 of the NCEES Supplied Reference Handbook, Version 9.4 for Computer Based Testing.

CONCEPT INTRO: The NORMAL DISTRIBUTION refers to a family of continuous probability distributions defined by the normal equation. The parameters of the normal distribution are ๐œ‡๐œ‡ ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐œŽ๐œŽ.

The FORMULA for NORMAL PROBABILITY DENSITY FUNCTION can be referenced under the SUBJECT of ENGINEERING PROBABILITY AND STATISTICS on page 39 of the NCEES Supplied Reference Handbook, Version 9.4 for Computer Based Testing. The normal distribution is a unimodel distribution, such that ๐‘ฅ๐‘ฅ = ๐œ‡๐œ‡ with two points of inflection (each located at a distance of ๐œŽ๐œŽ to ether side of the mode). The average of ๐‘›๐‘› observations tends to become normally distributed as ๐‘›๐‘› increases. ๐‘“๐‘“ ๐‘ฅ๐‘ฅ =



1

๐œŽ๐œŽ 2๐œ‹๐œ‹

๐‘’๐‘’

/

0 2/3 5 1 4 โˆ’

Made with

โˆž โ‰ค ๐‘ฅ๐‘ฅ โ‰ค โˆž

by Prepineer | Prepineer.com

Where: โ€ข ๐œ‡๐œ‡ is the population mean

โ€ข ๐œŽ๐œŽ is the standard deviation of the population โ€ข ๐‘ฅ๐‘ฅ is a normal random variable โ€ข ๐œŽ๐œŽ 1 is the standard deviation

The normal equation is the PROBABILITY DENSITY FUNCTION for the normal distribution. The graph of the normal distribution depends on two factors โ€“ the mean and the standard deviation. The MEAN OF THE DISTRIBUTION determines the location of the center of the graph, and the standard deviation determines the height and width of the graph. When the standard deviation is large, the curve is short and wide; when the standard deviation is small, the curve is tall and narrow. All normal distribution looks like a symmetric, bell-shaped curve. There are a few key points to remember when dealing with continuous probability distributions: โ€ข The total area under the normal curve is equal to 1.

โ€ข The probability that a normal random variable ๐‘ฅ๐‘ฅ equals any particular value is 0.

โ€ข The probability that ๐‘ฅ๐‘ฅ is greater than a certain value ๐‘Ž๐‘Ž equals the area under the normal curve bounded by ๐‘Ž๐‘Ž and plus infinity.

โ€ข The probability that ๐‘ฅ๐‘ฅ is less than a certain value ๐‘Ž๐‘Ž equals the area under the normal curve bounded by ๐‘Ž๐‘Ž and minus infinity.



Made with

by Prepineer | Prepineer.com

Every normal curve (regardless of its mean or standard deviation) conforms to the 6895-99.7 rule, that is: โ€ข 68% of the area under the curve falls within 1 standard deviation of the mean โ€ข 95% of the area under the curve falls within 2 standard deviations of the mean โ€ข 99.7% of the area under the curve falls within 3 standard deviations of the mean. To find the probability associated with a normal random variable of a standard normal distribution, we can use the Z-SCORE and the normal distribution table. A z-score follows a standardized normal distribution function. The FORMULA FOR CALCULATING A Z-SCORE can be referenced under the topic of NORMAL DISTRIBUTION (GAUSSIAN DISTRIBUTION) on page 39 of the NCEES Supplied Reference Handbook, Version 9.4 for Computer Based Testing. Every normal random variable ๐‘ฅ๐‘ฅ can be transformed into a z-score using the equation below:

๐‘๐‘ = Where:

๐‘ฅ๐‘ฅ โˆ’ ๐œ‡๐œ‡ ๐œŽ๐œŽ

โ€ข ๐‘ฅ๐‘ฅ is the normal random variable โ€ข ๐œ‡๐œ‡ is the mean

โ€ข ๐œŽ๐œŽ is the standard deviation of ๐‘ฅ๐‘ฅ



Made with

by Prepineer | Prepineer.com

The NORMAL DISTRIBUTION TABLE can be referenced under the topic of ENGINEERING PROBABILITY AND STATISTICS on page 9.4 of the NCEES Supplied Reference Handbook, Version 9.4 for Computer Based Testing. The normal distribution function can not be integrated so we use Z-TABLES or the NORMAL DISTRIBUTION TABLE, which are for the standard normal distribution with ๐œ‡๐œ‡ = 0 ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐œŽ๐œŽ = 1.

The normal distribution shows a cumulative probability associated with a particular zscore. Table rows show the whole number and tenths place of the z-score. Table columns show the hundredths place. The cumulative probability appears in the cell of the table. The NOTATIONS ON THE UNIT NORMAL DISTRIBUTION TABLE can be referenced under the topic of NORMAL DISTRIBUTION (GAUSSIAN DISTRIBUTION) on page 39 of the NCEES Supplied Reference Handbook, Version 9.4 for Computer Based Testing. A unit normal distribution table is included at the end of the reference handbook section on probability and statistics. In the table, the following notations are utilized: โ€ข ๐น๐น ๐‘ฅ๐‘ฅ = ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘ ๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“ โˆ’ โˆž ๐‘ก๐‘ก๐‘ก๐‘ก ๐‘ฅ๐‘ฅ โ€ข ๐‘…๐‘… ๐‘ฅ๐‘ฅ = ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘ ๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘š๐‘š ๐‘ฅ๐‘ฅ ๐‘ก๐‘ก๐‘ก๐‘ก โˆž

โ€ข ๐‘Š๐‘Š ๐‘ฅ๐‘ฅ = ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘ ๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“ โˆ’ ๐‘ฅ๐‘ฅ ๐‘ก๐‘ก๐‘ก๐‘ก ๐‘ฅ๐‘ฅ

โ€ข ๐น๐น โˆ’๐‘ฅ๐‘ฅ = 1 โˆ’ ๐น๐น(๐‘ฅ๐‘ฅ)



Made with

by Prepineer | Prepineer.com

Often times we may not be interested in the probability that a normal random variable falls between minus infinity and a given value. We may want to know the probability that it lies between a given value and plus infinity, or lies between two given values. These probabilities are easy to computer from a normal distribution table. Consider the following: โ€ข The probability that a standard normal random variable (๐‘ง๐‘ง) is greater than a given value (๐‘Ž๐‘Ž), ๐‘ƒ๐‘ƒ(๐‘ง๐‘ง < ๐‘Ž๐‘Ž), is determined by 1 โˆ’ ๐‘ƒ๐‘ƒ(๐‘ง๐‘ง < ๐‘Ž๐‘Ž).

โ€ข The probability that a standard normal random variable lies between two values, ๐‘ƒ๐‘ƒ(๐‘Ž๐‘Ž < ๐‘ง๐‘ง < ๐‘๐‘) is ๐‘ƒ๐‘ƒ(๐‘ง๐‘ง < ๐‘๐‘) โˆ’ ๐‘ƒ๐‘ƒ(๐‘ง๐‘ง < ๐‘Ž๐‘Ž).

Often, events in the real world follow a normal distribution. This allows us to use the normal distribution as a model for assessing probabilities associated with these events.



Made with

by Prepineer | Prepineer.com

CONCEPT EXAMPLE: A company pays its employees an average wage of $3.25/hour with a standard deviation of 60 cents. If the wages are approximately normally distributed, the proportion of the workers getting wages between $2.75 and $3.69 an hour is most close to: A. 50%

B. 55% C. 60%

D. 65%



Made with

by Prepineer | Prepineer.com

SOLUTION: The FORMULA FOR CALCULATING A Z-SCORE can be referenced under the topic of NORMAL DISTRIBUTION (GAUSSIAN DISTRIBUTION) on page 39 of the NCEES Supplied Reference Handbook, Version 9.4 for Computer Based Testing. Every normal random variable ๐‘ฅ๐‘ฅ can be transformed into a z-score using the equation below:

๐‘๐‘ = Where:

๐‘ฅ๐‘ฅ โˆ’ ๐œ‡๐œ‡ ๐œŽ๐œŽ

โ€ข ๐‘ฅ๐‘ฅ is the normal random variable โ€ข ๐œ‡๐œ‡ is the mean

โ€ข ๐œŽ๐œŽ is the standard deviation of ๐‘ฅ๐‘ฅ

To analyze this problem, we will calculate the z-score and then use the normal distributions tables, to find probabilities associated with the z-scores. We are given the following values to solve for z-scores of each wage range: โ€ข x0 = 2.75

โ€ข ๐‘ฅ๐‘ฅ1 = 3.69 โ€ข ๐œ‡๐œ‡ = 3.25

โ€ข ๐œŽ๐œŽ = 0.60



Made with

by Prepineer | Prepineer.com

For the first group of with wages between $2.75-$3.25 per hour, we will calculate the zscore, ๐‘๐‘0 , of the probability a wage falls in that range as: ๐‘ง๐‘ง0 =

(2.75 โˆ’ 3.25) = โˆ’0.833 0.60

For the second group of with wages between $3.25-$3.69 per hour, we will calculate the z-score, ๐‘๐‘1 , of the probability a wage falls in that range as: ๐‘ง๐‘ง1 =

(3.69 โˆ’ 3.25) = 0.733 0.60

Which means that:

๐‘๐‘ 2.75 < ๐‘ฅ๐‘ฅ < 3.69 = ๐‘ƒ๐‘ƒ(โˆ’0.833 < ๐‘ง๐‘ง < 0.733)

We then look at the normal distribution tables to find the corresponding probability values for each z-score. A unit normal distribution table is included at the end of the reference handbook section on probability and statistics. In the table, the following notations are utilized: โ€ข ๐น๐น ๐‘ฅ๐‘ฅ = ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘ ๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“ โˆ’ โˆž ๐‘ก๐‘ก๐‘ก๐‘ก ๐‘ฅ๐‘ฅ

โ€ข ๐‘…๐‘… ๐‘ฅ๐‘ฅ = ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘ ๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“ ๐‘ฅ๐‘ฅ ๐‘ก๐‘ก๐‘ก๐‘ก โˆž

โ€ข ๐‘Š๐‘Š ๐‘ฅ๐‘ฅ = ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž๐‘Ž ๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข๐‘ข ๐‘ก๐‘กโ„Ž๐‘’๐‘’ ๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘๐‘ ๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“๐‘“ โˆ’ ๐‘ฅ๐‘ฅ ๐‘ก๐‘ก๐‘ก๐‘ก ๐‘ฅ๐‘ฅ

โ€ข ๐น๐น โˆ’๐‘ฅ๐‘ฅ = 1 โˆ’ ๐น๐น(๐‘ฅ๐‘ฅ)



Made with

by Prepineer | Prepineer.com

For the first group of with wages between $2.75-$3.25 per hour, we use the z-score, ๐‘๐‘0 = 0.833, to find the probability the wages are LESS THAN $3.25 an hour. We use the

๐น๐น(๐‘ฅ๐‘ฅ) column to indicate ๐‘ƒ๐‘ƒ ๐‘ฅ๐‘ฅ < 3.25 and ๐‘ฅ๐‘ฅ = 0.8 row, as this is the closet value we can

use without interpolating. We use the F(x) column as we are looking to calculate the area under the curve from โˆ’โˆž ๐‘ก๐‘ก๐‘ก๐‘ก ๐‘ฅ๐‘ฅ, ๐‘ค๐‘คโ„Ž๐‘’๐‘’๐‘’๐‘’๐‘’๐‘’ ๐‘ฅ๐‘ฅ = 0.8.

The NORMAL DISTRIBUTION TABLE can be referenced under the topic of ENGINEERING PROBABILITY AND STATISTICS on page 9.4 of the NCEES Supplied Reference Handbook, Version 9.4 for Computer Based Testing.

๐‘ƒ๐‘ƒ ๐‘ฅ๐‘ฅ < 3.25 = โˆ’0.833 = 0.7881



Made with

by Prepineer | Prepineer.com

For the second group of with wages between $3.25-$3.69 per hour, we will calculate the z-score, ๐‘๐‘1 = 0.733, to find the probability the wages are GREATER THAN $3.25 an

hour. We use the ๐น๐น(๐‘ฅ๐‘ฅ) column to indicate ๐‘ƒ๐‘ƒ ๐‘ฅ๐‘ฅ > 3.25 =and ๐‘ฅ๐‘ฅ = 0.7 row, as this is the

closet value we can use without interpolating. We use the ๐‘…๐‘…(๐‘ฅ๐‘ฅ) column as we are looking to calculate the area under the curve from ๐‘ฅ๐‘ฅ ๐‘ก๐‘ก๐‘ก๐‘ก โˆž, ๐‘ค๐‘คโ„Ž๐‘’๐‘’๐‘’๐‘’๐‘’๐‘’ ๐‘ฅ๐‘ฅ = 0.7.

๐‘ƒ๐‘ƒ ๐‘ฅ๐‘ฅ > 3.25 = ๐‘ƒ๐‘ƒ 0.733 = 0.2420

We then sum the probabilities of each range to calculate the percentage the proportion of the workers getting wages between $2.75 and $3.69 an hour: ๐‘ƒ๐‘ƒ ๐‘ฅ๐‘ฅ < 3.25 โˆ’ ๐‘ƒ๐‘ƒ ๐‘ฅ๐‘ฅ > 3.25 = 0.7881 โˆ’ 0.2420 = 0.5461 = 54.61%



Made with

by Prepineer | Prepineer.com

Using our calculator hack, we get an answer of 0.566 or 56.6%, which has a percent

difference of 1.05% from the method of solving for the problem by hand. On the FE

Exam, we should look for the closest answer choice, and realize that NCEES will not put any choices that have a percent difference of less than 10%.

Therefore, the correct answer is ๐๐. ๐Ÿ“๐Ÿ“๐Ÿ“๐Ÿ“. %



Made with

by Prepineer | Prepineer.com