Understanding Statistical Functions in Google Sheets
September 11th, 2023
Data analysis is an invaluable component of many professions. Google Sheets equips users with a variety of statistical functions ranging from basic variance calculations to advanced distributions and tests. This guide not only introduces these functions but also delves into when and why you might use them.
Basic Statistical Measures
VARA
and VAR.P
Calculate the variance of a dataset. While VARA
considers both numbers and text, VAR.P
focuses solely on numbers for the entire population.
When to use: To assess the spread or variability in a dataset, especially when determining risk or volatility in financial scenarios.
=VARA(B2:B10)
=VAR.P(C2:C10)
STDEV.P
and STDEVPA
Determine the amount of variation or dispersion in your data set.
When to use: When assessing the consistency and reliability of data sets, such as quality control in manufacturing.
=STDEV.P(D2:D10)
=STDEVPA(E2:E10)
Distributions
GAMMAINV
and GAMMA.INV
Inverse functions of the gamma distribution.
When to use: When modeling events that happen at particular intervals, like the number of calls received at a call center in an hour.
=GAMMAINV(0.5,2,3)
=GAMMA.INV(0.7,5,1)
NORM.INV
and NORMINV
Determine the inverse of the normal cumulative distribution for specific mean and standard deviation.
When to use: In stock market analyses or when predicting future values based on historical data.
=NORM.INV(0.9,0,1)
=NORMINV(0.95,3,2)
NORM.S.DIST
and WEIBULL.DIST
When to use: NORM.S.DIST
for understanding probabilities under the standard bell curve. WEIBULL.DIST
in reliability engineering to measure the likelihood of failure over time.
=NORM.S.DIST(1.5,TRUE)
=WEIBULL.DIST(2.5,6,7,TRUE)
T.INV
and FDIST
When to use: In hypothesis testing. T.INV
for smaller sample sizes and FDIST
to compare variances of two datasets.
=T.INV(0.1,7)
=FDIST(0.8,6,7)
HYPGEOMDIST
and NEGBINOM.DIST
When to use: In market research to understand how likely a specific outcome is, given a particular sample size.
=HYPGEOMDIST(4,10,5,20)
=NEGBINOM.DIST(4,6,0.7,TRUE)
Advanced Functions and Tests
KURT
Measures data "tailedness".
When to use: To identify the presence of outliers or extreme values in a dataset.
=KURT(A2:A20)
ZTEST
and T.TEST
When to use: To determine if differences between groups are statistically significant, like in A/B testing.
=ZTEST(B2:B30, 50)
=T.TEST(C2:C30, D2:D30,2,1)
ERFC.PRECISE
and GAMMALN.PRECISE
When to use: In advanced statistical analyses where high precision is necessary, such as in scientific research.
=ERFC.PRECISE(0.7)
=GAMMALN.PRECISE(5)
Conclusion
With a solid understanding of Google Sheets' statistical functions and insights into their applications, you can elevate your data analysis skills. As you explore these functions and their use-cases, you'll uncover the potential of Google Sheets as a powerful statistical tool.