Statistical Concepts

Z-Score: What It Is, Formula, and How to Interpret It

6 min read

Learn what a z-score is, how to calculate it with the standard formula, and how to interpret standardized values in research and survey analysis.

What Is a Z-Score?

A z-score (also called a standard score) measures how many standard deviations a data point sits above or below the mean of a dataset. A z-score of 0 means the value is exactly at the mean, a positive z-score means it's above the mean, and a negative z-score means it's below. For example, a z-score of +2.0 tells you a value is two standard deviations above average. Z-scores standardize data onto a common scale, making it possible to compare values from different distributions, like comparing satisfaction scores across product lines that use different rating scales.

Why Z-Scores Matter

Z-scores let you put any data point in context by answering the question "how unusual is this value?" Without standardization, a raw score of 78 means nothing until you know the distribution it came from. In market research, z-scores help identify outlier respondents, compare metrics across different scales, and determine whether observed differences are statistically meaningful.

How Z-Scores Work

The Formula

The z-score for an individual data point is calculated as:

z = (X - μ) / σ

Where:

  • X = the individual data point
  • μ (mu) = the population mean
  • σ (sigma) = the population standard deviation

When working with a sample rather than a full population, substitute the sample mean (x̄) and sample standard deviation (s):

z = (X - x̄) / s

Worked Example

Suppose you ran a customer satisfaction survey and the average score across all respondents is 72 with a standard deviation of 8. One respondent scored 88. Their z-score would be:

z = (88 - 72) / 8 = 16 / 8 = 2.0

This respondent scored exactly 2 standard deviations above the mean. Consulting a z-table, a z-score of 2.0 corresponds to the 97.7th percentile, meaning roughly 97.7% of respondents scored lower.

Another respondent scored 64:

z = (64 - 72) / 8 = -8 / 8 = -1.0

This person is 1 standard deviation below the mean, placing them around the 15.9th percentile.

Interpreting Z-Scores

For normally distributed data, z-scores follow a predictable pattern known as the empirical rule (68-95-99.7 rule):

  • About 68% of values fall between z = -1 and z = +1
  • About 95% of values fall between z = -2 and z = +2
  • About 99.7% of values fall between z = -3 and z = +3

Values beyond z = ±3 are rare in a normal distribution (less than 0.3% of data), which is why z-scores of ±3 or beyond are commonly used as outlier thresholds.

Z-Scores in Hypothesis Testing

Z-scores underpin the z-test, which determines whether a sample mean differs significantly from a known population mean. The test statistic is:

z = (x̄ - μ) / (σ / √n)

Here, σ / √n is the standard error of the mean. If the resulting z-value exceeds the critical value for your chosen significance level (e.g., ±1.96 for α = 0.05 in a two-tailed test), you reject the null hypothesis. This is the same logic behind confidence intervals: a 95% confidence interval corresponds to z = ±1.96.

Standardizing Across Scales

One of the most practical uses of z-scores in research is making different scales comparable. If you measured brand perception on a 1-10 scale for one product and a 1-100 scale for another, the raw numbers aren't directly comparable. Converting both sets of scores to z-scores puts them on the same standardized scale, letting you make apples-to-apples comparisons.

When to Use Z-Scores

  • Identifying outlier respondents in survey data who gave unusually high or low ratings across all questions
  • Comparing scores across different scales or instruments that use different units of measurement
  • Hypothesis testing when the population standard deviation is known and you need to evaluate statistical significance
  • Normalizing data before running multivariate analyses like cluster analysis or factor analysis
  • Setting thresholds for automated quality checks in survey data (e.g., flagging respondents with average z-scores beyond ±2)

Common Mistakes to Avoid

  • Assuming normality without checking: z-score interpretation relies on the data being approximately normally distributed; with skewed data, the percentile translations don't hold
  • Using z-scores with tiny samples: with fewer than 30 observations, the t-distribution is more appropriate than the z-distribution because the standard deviation estimate is less stable
  • Confusing z-scores with percentiles directly: a z-score of 1.0 doesn't mean "top 1%"; it corresponds to approximately the 84th percentile

How Quali-Fi Supports Z-Score Analysis

Quali-Fi automatically calculates standardized scores when you enable data quality checks, flagging respondents whose patterns fall outside expected ranges. For advanced analysis, the Research plan includes cross-tabulation with significance testing that uses z-tests to compare subgroup proportions and means.

Explore Quali-Fi's analysis tools

Frequently Asked Questions

What's the difference between a z-score and a t-score?

Both measure how far a value is from the mean in standard deviation units, but they use different reference distributions. Z-scores rely on the normal distribution and require a known population standard deviation. T-scores use the t-distribution, which accounts for extra uncertainty when working with small samples and an estimated standard deviation. As sample size grows (typically past 30), the two converge.

Can z-scores be greater than 3 or less than -3?

Yes, but it's uncommon in normally distributed data, values beyond ±3 represent less than 0.3% of observations. In practice, z-scores of ±3 or more often signal outliers or indicate that the data isn't normally distributed. Some datasets with heavy tails (like income data) regularly produce z-scores beyond ±3.

How do z-scores relate to confidence intervals?

Confidence intervals use z-scores (or t-scores) as multipliers. A 95% confidence interval uses z = 1.96, meaning the interval extends 1.96 standard errors above and below the sample mean. A 99% confidence interval uses z = 2.576. The z-score defines how wide the interval needs to be to capture the true population parameter at your chosen confidence level.

Frequently Asked Questions

Related Guides

Put it into practice

Ready to apply this in your research?

Quali-Fi makes it easy to run surveys, conjoint studies, and more, all in one platform.