MGCR 271 Crib Sheet By Kareem Halabi Stem plot: Stem is first few digits, 2nd column represents the last digit of each data point (can be multiple of the same number) 1
Mean (population mean is ΞΌ): π₯ = β π₯π π
Sample standard deviation (quantifiable spread): π π₯ , π π₯, π β² or π₯ππβ1 = β
β(π₯βπ₯)2
β(π₯βπ)2
π₯βπ₯ π π₯
(if |π§| > 3 then the number
is an outlier). All the z scores of a data set will have π₯ = 0, π π₯ = 1 Coefficient of Variation: πΆπ =
π π₯ π₯Μ
Intuitive definition of Percentiles: If n data are arranged in numerical order, a number x is called a pth percentile if At least p% of the data β€ x At least (100-p)% of the data β₯ x
Put the data into numerical order Calculate π% (π + 1) = π (an integer) + π (decimal) The pth percentile is: ππ + π (ππ+π β ππ )
First Quartile: Q1 = 25th percentile Median: M = 50th percentile Third Quartile: Q3 = 75th percentile Interquartile Range: IQR = Q3 - Q1 Outliers by boxplot criterion: High outlier: π₯ > π3 + 1.5 Γ πΌππ Low outlier: π₯ < π1 β 1.5 Γ πΌππ
Q1
M
Q3
Highest non-outlier
π
Negative (left) skew: π₯π
π(πΈ | πΉ) =
1.
π=
Conditional Probability: Probability of E happening if F also happens
Example Turkeyβs boxplot:
Lowest non-outlier
2
π
Z-score: π§ =
1. 2.
Random variable: assigns a numerical value to every possible outcome of an experiment Discrete random variable: When there is a gap between successive possible values (Ex: can have 72 or 73 people but not 72.3) Residual (vertical distance) for (ππ , ππ ): ππ = Continuous random variable: can assume all π¦π β π β ππ₯π vales in some interval Ordinary Least Squares regression: Goal is Probability Distribution Function (PDF): the to minimize β ππ 2 set of all possible values of a discrete random π¦ = π + ππ₯ variable together with their probabilities π=
Median Third Quartile Highest non-outlier (by boxplot criterion)
πβ1
Population standard deviation (always use this when dealing with decimal percentages): ππ₯ = β
3. 4. 5.
Lowest non-outlier (by boxplot criterion) First Quartile
Empirical Rule: If a data set is unimodal and not very skewed, then 1. ~68% of data are within 1π π₯ of π₯Μ 2. ~95% of data are within 2π π₯ of π₯Μ 3. ~99.7% of data are within 3π π₯ of π₯Μ
Expected Value of x: πΈ(π₯) = π(π₯) = ππ₯ = π₯Μ Statistical Independence: The probability of A happening is independent of whether or not B happens (the second flip of a coin is independent from the first flip) 1. 2. 3.
There are only two outcomes to an experiment: Success and failure Let p be the probability of success, and it is the same every time
If the experiment is performed n times, the probability of x successes and n-x failures is: π ( ) ππ (π β π)πβπ on calc: (nCx)(p^x)(1-p)^(n-x) π For Binomial problems: πΈ(π₯) = ππ
π(π₯) = βππ(1 β π) End of Midterm Material