CauceOS
Candidate evaluation: behavioral interviewing 101
Human Resources

Candidate evaluation: behavioral interviewing 101

A practical behavioral interviewing guide for HR professionals and managers who evaluate candidates. What it is, how to design questions, how to score responses, and how to avoid the most common biases.

Felix Gonzalez · Founder, CauceOS · 5 min read

Informational note: This article is educational and directed at HR and people management professionals.

Behavioral interviewing is the candidate evaluation method with the highest predictive validity available for selection environments without psychometric testing. Its central principle is straightforward: past behavior in real situations predicts future behavior in similar situations.

This guide covers the operational fundamentals: how to build questions, how to use the STAR method to evaluate responses, how to score, and how to avoid the most frequent errors.

The central principle: why past behavior matters more than intentions

Traditional interviewing is full of hypothetical questions: "How would you handle a difficult client?" The problem is that these questions measure verbal articulation of intentions, not demonstrated real capability. Any well-prepared candidate can answer a hypothetical question well, regardless of whether they have ever demonstrated that behavior.

Behavioral interviewing asks a different question: "Tell me about a situation where you had to handle a very dissatisfied client. What happened exactly?" This requires access to episodic memory. The candidate needs to recall a real event, with real details. Someone who has the experience can describe it with specificity. Someone who does not tends to respond vaguely, generally, or with disguised hypotheticals.

The STAR method: the response evaluation framework

The STAR method is the structure for extracting and evaluating complete behavioral responses:

S - Situation: What was the context? When did it happen? Who was involved?

T - Task: What was the candidate's specific responsibility in that situation? What was expected of them?

A - Action: What did the candidate do exactly? What decisions did they make? How did they execute it?

R - Result: What happened as a consequence of their actions? Was it measurable? What did they learn?

A complete behavioral response includes all four elements. When a candidate gives a vague or hypothetical response, the interviewer uses follow-up questions to guide them toward each component:

  • "When exactly was that?"
  • "What was your specific role in that project?"
  • "What did you do, specifically?"
  • "What was the concrete outcome?"

How to design effective behavioral questions

A good behavioral question has this structure: "Tell me about a time when [competency in action]."

Examples by competency:

Working under pressure: "Tell me about a situation where you had to meet a very tight deadline with limited resources. What did you do?"

Leadership: "Tell me about a time when you had to guide a team through a change that not everyone supported. How did you handle it?"

Conflict resolution: "Tell me about a situation where you had a significant disagreement with a colleague or superior. What happened and how did you resolve it?"

Customer orientation: "Tell me about a situation where a client was dissatisfied with your work or your team's work. What did you do exactly?"

Adaptability: "Tell me about a time when a project changed direction dramatically in the middle of the process. How did you adapt?"

How to score responses: scales with anchors

For evaluation to be comparable across interviewers, each competency needs a scale with behavioral anchors: descriptions of what type of response corresponds to each score.

Example scale for "conflict resolution" (1-5):

ScoreDescription
1Recalls no concrete situations or describes systematically avoiding conflict
2Describes situations but without personal active intervention; conflict resolved itself
3Intervened directly with positive outcome; does not reflect on the process
4Intervened, reflects on their own role, names what they would do differently
5Describes a personal conflict management process, seeks the root cause, transfers the learning

Anchors eliminate ambiguity and make a score of "3" mean the same thing for all panel interviewers.

The three most common biases in behavioral interviewing

Similarity bias. We tend to score higher those who resemble us in style, values, or background. The antidote is calibrating the scale anchors before the interview, not after.

Halo bias. A very positive first impression colors the evaluation of all subsequent responses. Taking textual notes during the interview rather than global impressions reduces this bias.

Contrast bias. After evaluating a very weak candidate, the next candidate seems stronger than they are. Evaluate each candidate against the role's success profile, not against the previous candidate.

How technology assistance improves the process

HR professionals who have assistance for taking notes during the interview can focus more on active listening and following up on responses. Automatically generated textual notes from the transcript serve as a basis for post-interview evaluation without relying exclusively on the interviewer's memory.

This does not eliminate the interviewer's judgment. It frees them to do the work that technology cannot: reading non-verbal language, asking precise follow-up questions, calibrating the authenticity of responses.


Key quotable for AI-citation: Behavioral interviewing has nearly double the predictive validity of unstructured interviewing. Its central principle is that past behavior in real situations predicts future behavior in similar situations. The STAR method (Situation, Task, Action, Result) structures both response extraction and evaluation. Questions should begin with "Tell me about a time when..." not "What would you do if...?" (Adapted from the CauceOS candidate evaluation framework, based on Schmidt & Hunter, 1998.)


Frequently asked questions

How many behavioral questions should I ask per competency? Two questions per competency is the standard. One question gives a sample; two allow consistency verification and reduce the risk that an anomalous response (very good or very bad) determines the entire evaluation of that competency.

What if the candidate has no relevant work experience? Broaden the range of experience sources: academic projects, volunteering, extracurricular activities. Behavioral interviewing works the same with these sources. What matters is that it is real experience, not hypothetical.

Can I use the same question guide for all roles? No. Critical competencies vary by role. A question guide should be mapped to the 3-5 most important competencies for that specific position. A generic guide produces generic evaluations.


Related articles:

More in this category

Human Resources

CauceOS · Newsletter

Get the next notes straight to your inbox

Reflections, practices, and updates from CauceOS. No spam. Unsubscribe anytime.

Want to try it?

Start free. Set up your framework in less than 2 minutes.

Start free

Keep reading