Severity bias is the mirror image of leniency. When managers rate harshly across the board, the damage to engagement and trust is equally serious. Here is how HR addresses it.
Marketing Lead
May 19, 2026
•
4 Mins Read
Leniency bias gets most of the attention in performance management literature. Its mirror image, severity bias or strictness bias, is equally damaging and often harder to detect because it feels, to the manager practising it, like rigour rather than error.
A manager who rates most of their team as below average, who applies a standard so demanding that almost nobody meets it, or who uses low ratings as a motivational tool, is producing review data that is just as distorted as the leniency inflater. The difference is that the employees receiving the ratings are also being demotivated, which has an immediate and visible cost.
This article explains how to identify severity bias, why it happens, and the structural and conversational interventions that correct it.
Severity bias is not just a measurement problem. It is an engagement crisis in a specific team:
The same calibration challenge question that addresses leniency addresses severity: "This employee is rated below expectations. What specific evidence from this cycle supports that rating?"
For a manager with severity bias, the evidence may be real but the standard applied is different from what the organisation intends. "Below expectations" in this manager's system means "not quite as good as they could theoretically be." In the organisation's system it means "not consistently meeting core role requirements."
Calibration makes that standard divergence visible and correctable. It is not a personal attack on the manager's judgment. It is alignment on what the rating words mean.
Show the manager their distribution alongside the distributions of peer managers in comparable functions. "Your team's distribution shows 65% below the midpoint. Comparable teams in operations and finance show 18-25% below the midpoint. Either your team is significantly underperforming relative to similar teams, or there may be a standard difference. Let us explore this together."
The comparison data is not accusatory. It is the starting point for a calibration conversation that the manager needs.
Severity bias is often a definition problem: the manager has a higher personal standard for what constitutes meeting expectations than the organisation has set. The conversation should return explicitly to the rating anchors: "Our definition of meets expectations is [specific anchor language]. Does your rating of this employee match that definition?"
If the manager believes they are using low ratings as motivation, address this explicitly: "Low ratings can motivate the people who have the most options to leave. They rarely motivate people to stay and improve, particularly for sustained periods. The research on this is consistent: accurate ratings tied to specific development plans produce better performance outcomes than systematically low ratings."
This is possible, and it requires a different conversation. If the evidence genuinely supports that the team's output is substantially below the organisational norm, the question is why: is it a management problem, a skills problem, a resource problem, or a design problem? Low ratings without a structured response to the underlying cause are not performance management. They are documentation without development.
Senior manager severity bias requires a conversation at the executive or board level, not just an HR calibration session. When a senior leader's team is being systematically underrated, the damage is compounded by the manager's influence over career decisions. The CEO or relevant C-suite leader needs to be brought into the calibration outcome and the standard alignment conversation.
Rating severity is not rigour. It is a misapplication of a high personal standard to an organisational measurement system. The employees affected experience it not as a high bar that inspires improvement but as an unfair environment that inspires exit.
HR's role is to ensure the rating standard is consistent across managers, which means addressing both inflation and severity with the same calibration rigour. A distribution that is uniformly low is just as problematic as one that is uniformly high.