Chi-Square Test Applied to Survey Data: Walkthrough

Q: What sample size do I need for chi-square?

The requirement is about expected cell frequencies, not total sample size. Each cell in your crosstab should have an expected frequency of at least 5. For a 3x3 table, this typically means 45-90+ total respondents depending on how evenly distributed your categories are. Uneven distributions require larger samples.

Q: Can I use chi-square with ordinal data like Likert scales?

Technically yes, but chi-square ignores the ordering of categories. For ordinal data, the Mantel-Haenszel test for linear association or Mann-Whitney U test makes better use of the ordinal structure. Use chi-square for nominal variables where category order doesn't matter.

Q: What's the difference between chi-square and Fisher's exact test?

Both test independence between categorical variables. Chi-square uses an approximation that works well with adequate sample sizes. Fisher's exact test calculates the exact probability and is preferred when any expected cell frequency falls below 5 or total sample size is under 20. Most software runs both and flags when Fisher's test is more appropriate.

Learn how to apply the chi-square test to survey data, interpret results, check assumptions, and use chi-square for cross-tabulation significance testing.

What Is the Chi-Square Test Applied to Survey Data?

The chi-square test of independence is a statistical test that determines whether two categorical variables from a survey are associated or independent. When you cross-tabulate a survey question by a grouping variable (satisfaction level by customer segment, brand preference by age group, purchase intent by geographic region), the chi-square test tells you whether the observed differences in response patterns are statistically meaningful or could have occurred by random sampling variation. It compares the frequencies you actually observed in each cell of the crosstab to the frequencies you'd expect if the two variables had no relationship at all. If the discrepancy between observed and expected frequencies is large enough, the test returns a significant result, and you can conclude the variables are associated.

Why the Chi-Square Test Matters for Survey Research

Cross-tabulation is the most common survey analysis technique, and every crosstab implicitly raises the question: "Is this difference real?" When 68% of urban respondents prefer Brand A versus 59% of rural respondents, you need to know whether that 9-point gap reflects a genuine urban-rural difference or could easily appear by chance in a sample of this size. The chi-square test provides that answer. Without it, teams either over-interpret noise (redesigning a campaign because of a random blip) or under-appreciate real signals (ignoring a genuine segment difference because it "doesn't look that big").

How to Apply the Chi-Square Test to Survey Data

Setting Up the Test

You need two categorical variables and a count of respondents in each combination. The chi-square test works with the raw frequencies, not percentages. Your null hypothesis is that the two variables are independent (no association). The alternative hypothesis is that they're associated.

Consider a survey of 500 customers asking about support channel preference (phone, chat, email) crossed by age group (under 35, 35-54, 55+):

	Under 35	35-54	55+	Total
Phone	30	52	68	150
Chat	85	60	25	170
Email	55	58	67	180
Total	170	170	160	500

Calculating Expected Frequencies

For each cell, the expected frequency equals (row total x column total) / grand total. For the "Under 35, Phone" cell: (150 x 170) / 500 = 51. This is how many people you'd expect in that cell if age and channel preference were completely unrelated. Compare this to the observed count of 30. That's a substantial gap, suggesting younger respondents avoid phone support.

Computing the Test Statistic

The chi-square statistic sums the squared difference between observed and expected frequencies, divided by the expected frequency, across all cells: X2 = sum of (O - E)2 / E. With (rows - 1) x (columns - 1) = 2 x 2 = 4 degrees of freedom, you compare the test statistic to the chi-square distribution. For our example, X2 = 46.8, which is significant at p < 0.001. The conclusion: support channel preference is significantly associated with age group.

Interpreting the Results

A significant chi-square tells you the variables are associated but doesn't tell you where the association is strongest. Examine the adjusted standardized residuals for each cell. Residuals above +1.96 or below -1.96 indicate cells that are significantly higher or lower than expected. In our example, the "Under 35, Chat" cell would show a large positive residual (many more young chat users than expected), while "Under 35, Phone" would show a large negative residual. This pinpoints the specific group-by-category combinations driving the overall association.

Effect Size

Statistical significance depends on sample size. With 5,000 respondents, even trivial differences become significant. Cramer's V measures the strength of the association independent of sample size. V ranges from 0 (no association) to 1 (perfect association). Values below 0.10 are negligible, 0.10-0.30 are small to medium, and above 0.30 are strong. Always report Cramer's V alongside the chi-square result to distinguish between statistically significant and practically meaningful findings.

A Worked Example

A hotel chain surveyed 720 guests about their preferred booking method (direct website, OTA, phone, travel agent) crossed by travel purpose (business, leisure, bleisure). The chi-square test was significant: X2(6) = 38.4, p < 0.001, Cramer's V = 0.16. Adjusted residuals showed that business travelers booked directly at significantly higher rates than expected (+3.2), while leisure travelers used OTAs at significantly higher rates (+2.8). Phone booking was significantly associated with the 55+ age group across all travel purposes. The hotel used these findings to allocate digital marketing budget: direct booking campaigns targeted business travelers, while OTA optimization focused on leisure-oriented search terms.

When to Use the Chi-Square Test

Cross-tabulation significance testing whenever you compare response distributions across two or more groups on a categorical variable
Survey question association testing whether answers to one question predict answers to another (does satisfaction level relate to repurchase intent category?)
Segment profiling determining whether behavioral or attitudinal segments differ significantly on demographic characteristics
A/B testing with categorical outcomes comparing conversion rates, preference selections, or choice frequencies across experimental conditions
Pre/post comparison testing whether the distribution of responses changed significantly between two survey waves

Common Mistakes

Using chi-square when expected cell frequencies fall below 5 violates a key assumption and produces unreliable p-values; combine categories or use Fisher's exact test for small samples
Applying chi-square to continuous data without first categorizing it, or running it on percentages instead of raw counts, which produces incorrect results
Reporting significance without effect size which is especially misleading with large samples where trivially small differences achieve p < 0.001

How Quali-Fi Supports Chi-Square Testing

Quali-Fi's cross-tabulation tools include automatic chi-square testing with p-values and significance markers on every crosstab cell. The platform highlights cells with statistically significant over- or under-representation, so you can spot meaningful group differences at a glance without manual calculation.

Frequently Asked Questions

What sample size do I need for chi-square?

The requirement is about expected cell frequencies, not total sample size. Each cell in your crosstab should have an expected frequency of at least 5. For a 3x3 table, this typically means 45-90+ total respondents depending on how evenly distributed your categories are. Uneven distributions require larger samples.

Can I use chi-square with ordinal data like Likert scales?

Technically yes, but chi-square ignores the ordering of categories. For ordinal data, the Mantel-Haenszel test for linear association or Mann-Whitney U test makes better use of the ordinal structure. Use chi-square for nominal variables where category order doesn't matter.

What's the difference between chi-square and Fisher's exact test?

Both test independence between categorical variables. Chi-square uses an approximation that works well with adequate sample sizes. Fisher's exact test calculates the exact probability and is preferred when any expected cell frequency falls below 5 or total sample size is under 20. Most software runs both and flags when Fisher's test is more appropriate.

Test for significant differences in your survey data -- try Quali-Fi free for 14 days.

What Is the Chi-Square Test Applied to Survey Data?

Why the Chi-Square Test Matters for Survey Research

How to Apply the Chi-Square Test to Survey Data

Setting Up the Test

Calculating Expected Frequencies

Computing the Test Statistic

Interpreting the Results

Effect Size

A Worked Example

When to Use the Chi-Square Test

Common Mistakes

How Quali-Fi Supports Chi-Square Testing

Frequently Asked Questions

What sample size do I need for chi-square?

Can I use chi-square with ordinal data like Likert scales?

What's the difference between chi-square and Fisher's exact test?

Frequently Asked Questions

Related Guides

Cross-Tabulation Analysis: Applied Walkthrough

T-Test Applied to Survey Data: Walkthrough

ANOVA Applied to Survey Data: Walkthrough

Sample Size Formula: Detailed Walkthrough With Examples

Nominal vs. Ordinal: What It Is and How to Use It in Research

Ready to apply this in your research?

Chi-Square Test Applied to Survey Data: Walkthrough

What Is the Chi-Square Test Applied to Survey Data?

Why the Chi-Square Test Matters for Survey Research

How to Apply the Chi-Square Test to Survey Data

Setting Up the Test

Calculating Expected Frequencies

Computing the Test Statistic

Interpreting the Results

Effect Size

A Worked Example

When to Use the Chi-Square Test

Common Mistakes

How Quali-Fi Supports Chi-Square Testing

Frequently Asked Questions

What sample size do I need for chi-square?

Can I use chi-square with ordinal data like Likert scales?

What's the difference between chi-square and Fisher's exact test?

Related Topics

Frequently Asked Questions

Related Guides

Cross-Tabulation Analysis: Applied Walkthrough

T-Test Applied to Survey Data: Walkthrough

ANOVA Applied to Survey Data: Walkthrough

Sample Size Formula: Detailed Walkthrough With Examples

Nominal vs. Ordinal: What It Is and How to Use It in Research

Ready to apply this in your research?