Qualitative Methods

Theoretical Coding: What It Is and How to Build Theory from Qualitative Data

5 min read

Learn what theoretical coding is, how it specifies relationships between categories in grounded theory, and when to use theoretical coding to move from findings to theory.

What Is Theoretical Coding?

Theoretical coding is an advanced second-cycle qualitative coding method, originating in Barney Glaser's classic grounded theory, that specifies how substantive categories relate to each other within an emerging theoretical framework. After open coding (which Glaser calls substantive coding) has identified categories in the data, theoretical coding weaves those categories into a coherent theory by naming the type of relationship between them, causal, conditional, consequential, processual, or otherwise. Theoretical codes are conceptual connectors: they don't describe the data itself but describe how the pieces of your analysis fit together.

Why Theoretical Coding Matters

Categories alone don't constitute a theory. A study might identify trust erosion, information seeking, workaround development, and exit planning as major categories, but those are just a list until you specify how they connect. Theoretical coding supplies the "so what", it's the step that transforms a collection of categories into a theoretical model that explains how a phenomenon works. Without it, grounded theory analysis stops at categorization and never achieves its promise of theory generation.

How Theoretical Coding Works

Glaser's Coding Families

Glaser identified 18 theoretical coding families, pre-existing conceptual frameworks that researchers can use to think about how categories might relate. You don't force your data into these families; rather, you use them as sensitizing devices to recognize relationships in your data. Key families include:

The Six C's: causes, contexts, contingencies, consequences, covariances, conditions. This family asks: What causes the phenomenon? Under what conditions? With what consequences?

Process: stage, staging, phases, transitions, progressions. This family applies when your categories represent sequential steps.

Degree: intensity, range, extent, amount, level. This family applies when categories vary in magnitude across cases.

Type: type, form, kind, class, genre. This family organizes categories as varieties of a broader concept.

Strategy: strategies, tactics, mechanisms, techniques. This family applies when participants are actively managing, coping, or responding.

Interactive: interaction, mutual effects, interdependence, reciprocity. This family captures how actors influence each other.

The Process

Step 1: Achieve well-developed categories. Theoretical coding requires mature categories with clearly defined properties and dimensions. If your categories are still fuzzy, return to substantive coding or focused coding before attempting theoretical coding.

Step 2: Let theoretical codes emerge. Glaser insists that theoretical codes should emerge from the data, not be selected in advance. As you read your memos and review your categories, pay attention to the language you naturally use to describe relationships between them. If you keep writing "this leads to..." you may be working with a process family. If you write "under these conditions..." you're in the Six C's territory.

Step 3: Test the theoretical code. Once you've identified a potential theoretical relationship, check it against the data. Does the causal connection you've proposed hold across cases? Are there conditions under which it doesn't? Use negative case analysis to probe the boundaries.

Step 4: Integrate into a theoretical model. The theoretical code should help you write a coherent theoretical narrative, a grounded theory, that explains the central phenomenon of your study. This narrative connects your core category (identified during substantive coding) to other categories through the relationships specified by theoretical codes.

Example

In a study of freelancers managing income instability, substantive coding produces categories like diversifying income streams, building a financial buffer, lowering lifestyle expectations, maintaining professional identity, and client relationship management. Theoretical coding might reveal that these categories relate through a strategy framework, they're all tactics employed within a broader process of sustaining a freelance career under uncertainty. The theoretical code (strategy) specifies how the categories connect and places them within a coherent theoretical model.

When to Use Theoretical Coding

  • Classic grounded theory: theoretical coding is the standard method for moving from substantive categories to theory in Glaser's approach.
  • Any qualitative study aiming to build theory: when your goal extends beyond describing themes to explaining how a phenomenon works.
  • Mature analysis: theoretical coding happens late in the analytic process, after categories are well-developed and saturated.

Common Mistakes

  • Applying theoretical codes prematurely. If you impose a theoretical framework before your categories are fully developed, you'll build theory on shaky foundations. Theoretical coding requires patience, it comes after substantial substantive coding and memo writing.
  • Forcing a single coding family. Different relationships in your data may belong to different coding families. Your theory might involve both process relationships and causal relationships. Don't restrict yourself to one family when the data points to multiple types of connections.
  • Confusing theoretical codes with substantive codes. Substantive codes describe what's in the data. Theoretical codes describe how data-derived categories relate to each other. "Trust erosion" is a substantive code. "Trust erosion causes exit planning" includes a theoretical code (causal relationship). Keep the levels distinct.

Quali-Fi Support

Quali-Fi's AI-powered qualitative analysis handles the labor-intensive substantive coding that theoretical coding builds upon, generating initial codes from focus group transcripts, discussion boards, and open-ended responses. While the conceptual work of theoretical coding requires human analytic judgment, the platform's tools make it easier to develop and compare categories across large datasets, creating the foundation for theory building.

Build grounded theory with Quali-Fi's qualitative tools{:.cta-button }

FAQs

How is theoretical coding different from axial coding?

Axial coding (Strauss and Corbin) uses a specific paradigm model, conditions, context, strategies, consequences, to map relationships. Theoretical coding (Glaser) draws on a broader range of coding families and lets the theoretical framework emerge from the data rather than applying a predetermined paradigm. Both aim to specify relationships between categories, but they differ in structure and flexibility.

Do I need to know all 18 coding families?

No. Familiarity with the major families (Six C's, Process, Strategy, Type) gives you enough conceptual vocabulary to recognize relationships as they emerge. You don't select a family and apply it, you notice which family fits what you're already seeing in the data. The families are lenses, not templates.

Can theoretical coding be done without formal grounded theory?

Yes. Any qualitative researcher building explanatory models from data can use theoretical coding logic, specifying how categories relate causally, sequentially, or conditionally. You don't need to follow the full grounded theory methodology to benefit from thinking theoretically about the relationships between your findings.

Related Guides

Qualitative Methods

Qualitative Research Methods: A Complete Guide to Approaches, Coding, and Rigor

Learn the major qualitative research methods, coding techniques, and trustworthiness criteria used in market research, UX, and social science.

13 min readRead
Qualitative Methods

Qualitative Coding: What It Is and How to Code Qualitative Data

Learn what qualitative coding is, explore major coding approaches (open, axial, in vivo, and more), and understand how coding transforms raw data into research findings.

6 min readRead
Qualitative Methods

Selective Coding: What It Is and How to Use It in Grounded Theory

Learn what selective coding is, how to identify a core category in grounded theory, and when selective coding moves your analysis from categories to theory.

5 min readRead
Qualitative Methods

Focused Coding: What It Is and How to Refine Your Qualitative Analysis

Learn what focused coding is, how it selects the most productive initial codes for deeper analysis, and when focused coding sharpens your qualitative findings.

5 min readRead
Qualitative Methods

Axial Coding: What It Is and How to Use It in Qualitative Research

Learn what axial coding is, how it connects categories in grounded theory, and when to use axial coding in qualitative data analysis.

5 min readRead
AI in Research

AI-Powered Qualitative Analysis: What's Possible Today

What AI can and can't do in qualitative analysis. Practical guide to AI coding, theme detection, and sentiment analysis for research teams.

8 min readRead
Research Methodology

Mixed Methods Research: What It Is and How to Use It in Research

Mixed methods research combines qualitative and quantitative approaches in a single study. Learn the designs, benefits, and practical applications for research.

8 min readRead

Put it into practice

Ready to apply this in your research?

Quali-Fi makes it easy to run surveys, conjoint studies, and more, all in one platform.