Student Feedback Should Inform Staffing, But Never Decide It Alone

April 10, 2026 by Sachit Lakkaraju

Student feedback matters. It can reveal problems that administrators may not see from a timetable, a workload report, or a staff profile: unclear communication, inconsistent delivery, weak organisation, assessment confusion, or a lecturer who is not well matched to a particular subject. For higher education institutions, ignoring student feedback would be a mistake.

But using student feedback as the main driver of staff allocation would be a different mistake. The research literature is clear on this point: student evaluations of teaching can be useful, but they are not a complete, neutral, or standalone measure of teaching quality. Once feedback starts influencing future staffing decisions, it needs to be handled as one signal inside a broader decision-support process, not as a ranking table where the highest score automatically wins.

That distinction matters. A feedback score may look objective because it is numeric, but the number is shaped by context: class size, subject difficulty, delivery mode, response rate, student expectations, grading pressure, lecturer identity, and whether the students who responded are representative of the whole class.

1. Student Feedback Is Useful, But It Measures Experience First

Student evaluations are often treated as if they directly measure teaching quality. In practice, they are better understood as evidence about student experience. That experience is important, but it is not the same thing as a complete assessment of teaching effectiveness.

They Capture What Students Notice: Students are well placed to comment on communication, clarity, organisation, responsiveness, and whether the learning experience felt coherent. These are real teaching signals and should not be dismissed.
They Do Not Capture Everything: Students may not be able to judge curriculum design, long-term learning impact, assessment validity, disciplinary rigour, or whether a difficult subject was taught well despite lower satisfaction scores.
They Need Interpretation: Spooren, Brockx, and Mortelmans, in their Review of Educational Research overview of student evaluation literature, describe student evaluations as a valuable source of information, but not sufficient by themselves for evaluating teaching.

This is the first reason feedback should inform staffing rather than decide it. It gives administrators useful evidence, but that evidence needs to be read alongside other operational and academic factors.

2. Higher Scores Do Not Automatically Mean Better Learning

A common assumption is that higher student ratings mean better teaching outcomes. That assumption is weaker than it appears. Bob Uttl, Carmela White, and Daniela Gonzalez’s meta-analysis in Studies in Educational Evaluation found no significant relationship between student evaluation ratings and student learning in multisection studies.

Popularity Is Not the Same as Effectiveness: A lecturer may receive strong ratings because they are clear, supportive, and organised. But ratings may also be influenced by grading expectations, workload difficulty, personality, or whether the subject itself is enjoyable.
Difficult Subjects Can Be Penalised: A lecturer teaching a demanding quantitative, technical, or compulsory subject may face lower satisfaction even if the teaching is strong. If allocation systems reward raw satisfaction scores, institutions risk discouraging rigorous teaching.
Context Matters: A feedback score without class size, response rate, teaching mode, subject difficulty, and historical comparison can be misleading. A 4.1 average in one context may be more impressive than a 4.7 in another.

This does not make feedback useless. It means feedback should be treated as a contextual signal, not as a universal scoreboard.

3. Feedback Scores Can Contain Bias

The fairness problem becomes serious when student feedback is used in staff-related decisions. Anne Boring’s research in the Journal of Public Economics found evidence of gender bias in student evaluations of teaching, including patterns where male students rated male professors more favourably. Her study also found no evidence that male professors were better instructors, which makes the rating difference especially important.

Bias Can Look Like Performance: If a biased rating enters a spreadsheet or ranking system, it may appear as a neutral performance metric. The danger is that unfairness becomes hidden behind a number.
Small Design Choices Matter: Peterson and Boring’s PLOS ONE study found that anti-bias language shown before evaluations improved rankings of female instructors, suggesting that survey design can influence bias. That is useful, but it also proves the deeper point: evaluation scores are shaped by process design, not just teaching quality.
Bias Can Compound Through Ranking: Once feedback feeds into a ranked recommendation, the risk grows. A small unfair difference in survey score can push one lecturer above another, especially if the system treats feedback as a dominant criterion.

For institutions, the lesson is not to throw feedback away. The lesson is to design systems that prevent feedback from becoming an unchecked proxy for lecturer quality.

4. Staffing Decisions Need Multiple Signals

Academic allocation is a multi-criteria decision. A good staffing decision usually balances hard constraints, staff preferences, institutional needs, workload fairness, subject fit, student experience, and administrator judgement. No single input should dominate the entire process.

Hard Constraints First: Availability, timetable clashes, employment status, required qualifications, and campus constraints should determine whether an allocation is even possible.
Soft Criteria Second: Subject preference, location preference, prior teaching history, continuity, workload balance, and carefully interpreted feedback can then help compare valid options.
Feedback as One Signal: Student feedback should sit beside other criteria. It should help flag patterns, strengths, and risks, but it should not automatically determine who gets assigned to a class.

This is where ranking systems need care. Fairness researchers such as Zehlike, Yang, and Stoyanovich describe ranking fairness as a distinct problem because ranked systems affect exposure, opportunity, and outcomes. In staffing, that means a recommendation list is not neutral simply because it is ordered by a formula. The formula decides which lecturers are considered most visible, most suitable, and most likely to be chosen.

5. The Right Model Is Explainable, Reviewable, and Overrideable

A responsible staff allocation system should not hide behind an algorithm. If feedback influences a recommendation, administrators should be able to see how much it mattered, what other factors were considered, and whether the outcome should be accepted, rejected, or overridden.

Score Breakdowns: Instead of showing only a final rank, the system should show the contribution of factors such as availability match, subject preference, workload, location fit, and feedback trend.
Reason Codes: Human-readable explanations such as “strong availability match,” “preferred subject,” “workload concern,” or “feedback trend requires review” help administrators understand the recommendation rather than blindly accept it.
Human Override: Microsoft’s human-AI interaction guidelines emphasise that AI systems should support efficient correction when they are wrong. In staff allocation, that means administrators must be able to override a recommendation and record the reason.

The OECD AI Principles make a similar point at the governance level: AI systems should provide meaningful information so people can understand and challenge outcomes. That principle is especially relevant when recommendations affect staff workloads, teaching opportunities, and institutional decision-making.

Conclusion

Student feedback should absolutely inform academic staffing. It provides a valuable view of the learning experience and can help institutions identify strengths, risks, and improvement areas. But it should never be treated as a standalone measure of lecturer quality or used as the sole basis for staff allocation.

The better approach is multi-factor decision support: combine feedback with availability, subject fit, staff preferences, workload balance, institutional constraints, and human judgement. This is where platforms like StaffSense make sense. By treating student feedback as one carefully interpreted signal within a transparent recommendation process, StaffSense avoids the false simplicity of “highest rating wins” and supports staffing decisions that are more explainable, fair, and reviewable.