Descriptive Statistics

Mean

The population mean μ or sample mean x¯ of a dataset x1,x2,...,xN is given by:

μ=x¯=xiN

Note: represents i=1N.

Median

The median of a dataset is the middle value of the sorted dataset.

Mode

The mode of a dataset is the value that occurs most frequently.

Variance & Standard Deviation

The variance and standard deviation of a dataset are measures of the degree of variation of the data from the mean value.

Population Variance

The population variance σ2 of a dataset x1,x2,...,xN is given by:

σ2=(xiμ)2N=xi2Nμ2

Sample Variance

The sample variance s2 of a dataset x1,x2,...,xN is given by:

s2=(xix¯)2N1=xi2(xi)2NN1

Population Standard Deviation

The population standard deviation σ of a dataset x1,x2,...,xN is given by:

σ=(xiμ)2N=xi2Nμ2

Sample Standard Deviation

The sample standard deviation s of a dataset x1,x2,...,xN is given by:

s=(xix¯)2N1=xi2(xi)2NN1

Skewness

The skewness of a dataset is a measure of its asymmetry.

Fisher-Pearson Population Skewness

The Fisher-Pearson population skewness μ~3 of a dataset x1,x2,...,xN is given by:

μ~3=(xiμ)3Nσ3

Fisher-Pearson Sample Skewness

The Fisher-Pearson sample skewness g1 of a dataset x1,x2,...,xN is given by:

g1=(xix¯)3Ns3

Adjusted Fisher-Pearson Sample Skewness

The adjusted Fisher-Pearson sample skewness G1 of a dataset x1,x2,...,xN is given by:

G1=N(N1)N2×(xix¯)3Ns3

Covariance & Correlation

The covariance and correlation coefficient of two datasets are measures of linear relationship between the datasets.

Population Covariance

The population covariance σxy of datasets x1,x2,...,xN and y1,y2,...,yN is given by:

σxy(xiμx)(yiμy)N

Sample Covariance

The sample covariance sxy of datasets x1,x2,...,xN and y1,y2,...,yN is given by:

sxy(xix¯)(yiy¯)N1

Pearson Correlation Coefficient

The Pearson correlation coefficient rxy of datasets x1,x2,...,xN and y1,y2,...,yN is given by:

rxy=σxyσxσx=sxysxsx