Related papers: Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation

Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation

URL: http://arxiv.org/abs/2603.01775v1
Date: Mon, 02 Mar 2026 12:00:10 GMT
Title: Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation
Authors: Harry Stuart, Masahiro Kaneko, Timothy Baldwin,
Abstract summary: Large language models (LLMs) can play the role of subject matter experts to cost-effectively elicit information from each candidate.<n>We release code, a modest dataset of public-domain/anonymised resumes, belief calibration tests, and simulated interviews.
Score: 41.93085698478849
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Effective hiring is integral to the success of an organisation, but it is very challenging to find the most suitable candidates because expert evaluation (e.g.\ interviews conducted by a technical manager) are expensive to deploy at scale. Therefore, automated resume scoring and other applicant-screening methods are increasingly used to coarsely filter candidates, making decisions on limited information. We propose that large language models (LLMs) can play the role of subject matter experts to cost-effectively elicit information from each candidate that is nuanced and role-specific, thereby improving the quality of early-stage hiring decisions. We present a system that leverages an LLM interviewer to update belief over an applicant's rubric-oriented latent traits in a calibrated way. We evaluate our system on simulated interviews and show that belief converges towards the simulated applicants' artificially-constructed latent ability levels. We release code, a modest dataset of public-domain/anonymised resumes, belief calibration tests, and simulated interviews, at \href{https://github.com/mbzuai-nlp/beyond-the-resume}{https://github.com/mbzuai-nlp/beyond-the-resume}. Our demo is available at \href{https://btr.hstu.net}{https://btr.hstu.net}.

Related papers

SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery [55.50580661343875]
We introduce SparkMe, a multi-agent interviewer that performs deliberative planning via simulated conversation rollouts to select questions with high expected utility.<n>We evaluate SparkMe through controlled experiments with LLM-based interviewees, showing that it achieves higher interview utility.<n>We further validate SparkMe in a user study with 70 participants across 7 professions on the impact of AI on their professions.
arXiv Detail & Related papers (2026-02-24T17:33:02Z)
SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System [1.6273083168563973]
This paper introduces SimInterview, a large language model (LLM)-based simulated multilingual interview training system.<n>We show that the system consistently aligns its assessments with job requirements, faithfully preserves resume content, and earns high satisfaction ratings.<n>We also outlined a contestable AI design that can explain, detect bias, and preserve human-in-the-loop to meet emerging regulatory expectations.
arXiv Detail & Related papers (2025-08-16T02:18:36Z)
AI Hiring with LLMs: A Context-Aware and Explainable Multi-Agent Framework for Resume Screening [12.845918958645676]
We propose a multi-agent framework for resume screening using Large Language Models (LLMs)<n>The framework consists of four core agents, including a resume extractor, an evaluator, a summarizer, and a score formatter.<n>This dynamic adaptation enables personalized recruitment, bridging the gap between AI automation and talent acquisition.
arXiv Detail & Related papers (2025-04-01T12:56:39Z)
Using Large Language Models to Develop Requirements Elicitation Skills [1.1473376666000734]
We propose conditioning a large language model to play the role of the client during a chat-based interview.<n>We find that both approaches provide sufficient information for participants to construct technically sound solutions.
arXiv Detail & Related papers (2025-03-10T19:27:38Z)
MockLLM: A Multi-Agent Behavior Collaboration Framework for Online Job Seeking and Recruiting [29.676163697160945]
We propose textbfMockLLM, a novel framework to generate and evaluate mock interview interactions.<n>By simulating both interviewer and candidate roles, MockLLM enables consistent and collaborative interactions for real-time and two-sided matching.<n>We evaluate MockLLM on real-world data Boss Zhipin, a major Chinese recruitment platform.
arXiv Detail & Related papers (2024-05-28T12:23:16Z)
CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval [52.134133938779776]
We present CLARINET, a system that asks informative clarification questions by choosing questions whose answers would maximize certainty in the correct candidate. Our approach works by augmenting a large language model (LLM) to condition on a retrieval distribution, finetuning end-to-end to generate the question that would have maximized the rank of the true candidate at each turn.
arXiv Detail & Related papers (2024-04-28T18:21:31Z)
EZInterviewer: To Improve Job Interview Performance with Mock Interview Generator [60.2099886983184]
EZInterviewer aims to learn from the online interview data and provides mock interview services to the job seekers. To address the low-resource challenge, EZInterviewer is trained on a very small set of interview dialogs.
arXiv Detail & Related papers (2023-01-03T07:00:30Z)
NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory [92.98552727430483]
Narrations-as-Queries (NaQ) is a data augmentation strategy that transforms standard video-text narrations into training data for a video query localization model. NaQ improves multiple top models by substantial margins (even doubling their accuracy) We also demonstrate unique properties of our approach such as the ability to perform zero-shot and few-shot NLQ, and improved performance on queries about long-tail object categories.
arXiv Detail & Related papers (2023-01-02T16:40:15Z)
Toward a traceable, explainable, and fairJD/Resume recommendation system [10.820022470618234]
Development of an automatic recruitment system is still one of the main challenges. Our aim is to explore how modern language models can be combined with knowledge bases and datasets to enhance the JD/Resume matching process.
arXiv Detail & Related papers (2022-02-02T18:17:05Z)
MS-Ranker: Accumulating Evidence from Potentially Correct Candidates for Answer Selection [59.95429407899612]
We propose a novel reinforcement learning based multi-step ranking model, named MS-Ranker. We explicitly consider the potential correctness of candidates and update the evidence with a gating mechanism. Our model significantly outperforms existing methods that do not rely on external resources.
arXiv Detail & Related papers (2020-10-10T10:36:58Z)

This list is automatically generated from the titles and abstracts of the papers in this site.