
The Gottman Four Horsemen in virtual sessions: intervening at the exact moment
Criticism, contempt, defensiveness, and stonewalling are the four patterns that best predict the dissolution of a relationship. Detecting them live during a virtual session gives the couples therapist an intervention window the human eye alone cannot always reach.
John and Julie Gottman spent four decades on a single empirical question: which interaction patterns best predict the separation of a couple? The Love Lab at the University of Washington recorded thousands of hours of couple conversations and, with longitudinal follow-up, identified four behaviors whose sustained presence predicts dissolution with very high accuracy — hit rates around 90 percent in research samples.
They called them The Four Horsemen of the Apocalypse, a loaded but clinically useful metaphor:
- Criticism — attacking the person's character rather than a specific behavior.
- Contempt — moral superiority, sarcasm, mockery, derisive language. The most lethal of the four.
- Defensiveness — returning fire without accepting responsibility, playing victim.
- Stonewalling — disconnection, silence, emotional withdrawal inside the conversation.
A couples therapist trained in the Gottman Method knows how to identify them. The operational problem is different: identifying them in real time, while two people are talking, sometimes interrupting each other, sometimes in different languages, all on a Zoom or Meet screen, is exhausting.
What the human eye misses
In a typical 60-minute couples session, there are approximately 400 to 600 conversational turns. If the Four Horsemen appear in 15 percent of those turns — a reasonable share for distressed couples arriving in therapy — that is 60 to 90 instances per session.
No therapist can cognitively flag 90 instances in an hour and sustain empathic listening and decide when to intervene and track the relational dynamic and model regulation. Some are noticed, others are missed, and the couple leaves with an unnamed pattern.
Our internal testing with practicing therapists — with their explicit consent and de-identified transcripts — suggests that on average a therapist consciously identifies between 30 and 50 percent of the Horsemen instances that occur in a recorded session. It is not a lack of skill: it is attentional bandwidth.
What the co-pilot does
CauceOS listens to the session live, identifies each speaker separately, and applies specialized models trained to detect the four patterns in conversational language. For every turn where a Horseman appears, the system:
- Tags it with the category and sub-type (for example: "Contempt — directed sarcasm").
- Associates it with the speaker who produced it.
- Displays it discreetly in the therapist's side panel, without interrupting the session or the screen shared with the couple.
- Accumulates a count per session and per partner.
At the end of the session, the therapist has a clear view: how many instances of each Horseman appeared, from each partner, and at what moments. That information is direct material for case conceptualization and for psychoeducation with the couple in the next session.
Three concrete uses
1. Intervention in the moment
When one partner enters stonewalling — crosses their arms, stops looking at the screen, answers in monosyllables — the therapist can notice it. But sometimes stonewalling is subtle: the person keeps talking, but their content empties, their sentences shorten, their affect flattens. The co-pilot flags it before the therapist confirms it with their own attention. It gives the clinician two or three extra seconds to decide whether to name the pattern ("I notice you're withdrawing") or let it run.
2. Evidence-based psychoeducation
One of the difficulties of Gottman work is that couples, in session, deny doing what they are doing. "I don't criticize him." "He doesn't show me contempt." Showing concrete data — "in the last 20 minutes there were 7 instances labeled as criticism, all of them from you toward your partner" — turns a clinical claim into a shared observation. The couple stops arguing about whether the pattern exists and starts working on the pattern.
3. Longitudinal tracking
Across 10 to 15 sessions of treatment, the co-pilot accumulates the frequency of each Horseman per partner. If treatment is working, that chart should go down. If it does not, adjustments are needed. This turns the clinical sense ("I feel they are doing better") into an indicator comparable to those already used by validated marital satisfaction scales.
What we do not do
Some important clarifications:
- We do not interpret on behalf of the therapist. An instance of "defensiveness" may be, in context, a legitimate response to an unfair criticism. Only the clinician decides what to do with the information.
- We do not replace the validated scales of the Gottman Method (SPAFF, Couples Therapy Inventory, etc.). The co-pilot complements; it does not substitute.
- We do not record without explicit consent from both partners. The bot announces its presence when joining the session, and the recording is subject to the retention policy the therapist configures.
- We do not diagnose the relationship. We do not issue a verdict of "this couple will separate". That would be irresponsible and empirically incorrect: the Four Horsemen predict risk, not destiny.
A note on the Gottmans' work
What we are building rests on four decades of rigorous research that is not ours. The intellectual and empirical credit belongs to John and Julie Gottman, the Gottman Institute, and hundreds of researchers who have replicated, extended, and nuanced their findings.
Our contribution is operational: making the clinical observation the method already prescribes viable in real virtual sessions, with the bandwidth a human therapist actually has in daily practice. If the method works better with systematic observation, and systematic observation is hard to sustain live, then there is an implementation problem technology can solve — without touching the method itself.
For therapists trained in another method
If you work with EFT (Emotionally Focused Therapy), IBCT, IFS, or other frameworks, the co-pilot's categories can be configured to your model. The Four Horsemen is our most mature use case because the Gottmans' work offers the clearest operational definitions, but the architecture is extensible. If your framework has patterns detectable in language, we can detect them.
If you work with couples and systematic observation is part of your practice, we want to talk to you. The product is being tuned with your kind of feedback.
More in this category
PsychologyCauceOS · Newsletter
Get the next notes straight to your inbox
Reflections, practices, and updates from CauceOS. No spam. Unsubscribe anytime.
Keep reading
Psychologycrisis detection
Early detection of crisis signals in therapy: how we assist the clinician without replacing them
How the co-pilot identifies, in real time, language associated with suicidal ideation, self-harm, and domestic violence — and why latency matters as much as sensitivity.
Psychologydiarization
Voice diarization: how we know who speaks in couples sessions without getting it wrong
In couples therapy, misattributing who said what makes any report useless. The technical challenge of identifying two voices on a video call — and why it matters.
Psychologytherapeutic modalities
Therapeutic modalities in LATAM: what we discovered talking to 40 psychologists
We spoke with 40 psychologists from five Spanish-speaking countries to understand which therapeutic frameworks they actually use. What we found changed how we designed CauceOS.