Career Path Prediction using Resume Representation Learning and
Skill-based Matching
- URL: http://arxiv.org/abs/2310.15636v1
- Date: Tue, 24 Oct 2023 08:56:06 GMT
- Title: Career Path Prediction using Resume Representation Learning and
Skill-based Matching
- Authors: Jens-Joris Decorte, Jeroen Van Hautte, Johannes Deleu, Chris Develder
and Thomas Demeester
- Abstract summary: We present a novel representation learning approach, CareerBERT, specifically designed for work history data.
We develop a skill-based model and a text-based model for career path prediction, which achieve 35.24% and 39.61% recall@10 respectively.
- Score: 14.635764829230398
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The impact of person-job fit on job satisfaction and performance is widely
acknowledged, which highlights the importance of providing workers with next
steps at the right time in their career. This task of predicting the next step
in a career is known as career path prediction, and has diverse applications
such as turnover prevention and internal job mobility. Existing methods to
career path prediction rely on large amounts of private career history data to
model the interactions between job titles and companies. We propose leveraging
the unexplored textual descriptions that are part of work experience sections
in resumes. We introduce a structured dataset of 2,164 anonymized career
histories, annotated with ESCO occupation labels. Based on this dataset, we
present a novel representation learning approach, CareerBERT, specifically
designed for work history data. We develop a skill-based model and a text-based
model for career path prediction, which achieve 35.24% and 39.61% recall@10
respectively on our dataset. Finally, we show that both approaches are
complementary as a hybrid approach achieves the strongest result with 43.01%
recall@10.
Related papers
- Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training [51.60874286674908]
We focus on predicting performance on Closed-book Question Answering (CBQA) tasks, which are closely tied to pre-training data and knowledge retention.
We address three major challenges: 1) mastering the entire pre-training process, especially data construction; 2) evaluating a model's knowledge retention; and 3) predicting task-specific knowledge retention using only information available prior to training.
We introduce the SMI metric, an information-theoretic measure that quantifies the relationship between pre-training data, model size, and task-specific knowledge retention.
arXiv Detail & Related papers (2025-02-06T13:23:53Z) - KARRIEREWEGE: A Large Scale Career Path Prediction Dataset [29.24421465266904]
We introduce KARRIEREWEGE, a comprehensive, publicly available dataset containing over 500k career paths.
To tackle the problem of free-text inputs typically found in resumes, we enhance it by synthesizing job titles and descriptions.
This allows for accurate predictions from unstructured data, closely aligning with real-world application challenges.
arXiv Detail & Related papers (2024-12-19T08:02:08Z) - Human Action Anticipation: A Survey [86.415721659234]
The literature on behavior prediction spans various tasks, including action anticipation, activity forecasting, intent prediction, goal prediction, and so on.
Our survey aims to tie together this fragmented literature, covering recent technical innovations as well as the development of new large-scale datasets for model training and evaluation.
arXiv Detail & Related papers (2024-10-17T21:37:40Z) - CAPER: Enhancing Career Trajectory Prediction using Temporal Knowledge Graph and Ternary Relationship [23.845300607433792]
We propose a novel solution, named as CAPER, that solves the challenges via sophisticated temporal knowledge graph (TKG) modeling.
It enables the utilization of a graph-structured knowledge base with rich expressiveness, effectively preserving the changes in job movement patterns.
Experiments on a real-world career trajectory dataset demonstrate that CAPER consistently and significantly outperforms four baselines.
arXiv Detail & Related papers (2024-08-28T08:21:56Z) - Job-SDF: A Multi-Granularity Dataset for Job Skill Demand Forecasting and Benchmarking [59.87055275344965]
Job-SDF is a dataset designed to train and benchmark job-skill demand forecasting models.
Based on 10.35 million public job advertisements collected from major online recruitment platforms in China between 2021 and 2023.
Our dataset uniquely enables evaluating skill demand forecasting models at various granularities, including occupation, company, and regional levels.
arXiv Detail & Related papers (2024-06-17T07:22:51Z) - Unlocking Futures: A Natural Language Driven Career Prediction System for Computer Science and Software Engineering Students [0.5735035463793009]
This study contributes valuable insights to educational advising by providing specific career suggestions based on the unique features of CS and SWE students.
The research helps individual CS and SWE students find suitable jobs that match their skills, interests, and skill-related activities.
arXiv Detail & Related papers (2024-05-28T12:56:57Z) - TAROT: A Hierarchical Framework with Multitask Co-Pretraining on
Semi-Structured Data towards Effective Person-Job Fit [60.31175803899285]
We propose TAROT, a hierarchical multitask co-pretraining framework, to better utilize structural and semantic information for informative text embeddings.
TAROT targets semi-structured text in profiles and jobs, and it is co-pretained with multi-grained pretraining tasks to constrain the acquired semantic information at each level.
arXiv Detail & Related papers (2024-01-15T07:57:58Z) - Professional Network Matters: Connections Empower Person-Job Fit [62.20651880558674]
This paper emphasizes the importance of incorporating professional networks into the Person-Job Fit model.
We introduce a job-specific attention mechanism in CSAGNN to handle noisy professional networks.
We demonstrate the effectiveness of our approach through experimental evaluations conducted across three real-world recruitment datasets from LinkedIn.
arXiv Detail & Related papers (2023-12-19T06:44:44Z) - CAREER: A Foundation Model for Labor Sequence Data [21.38386300423882]
We develop CAREER, a foundation model for job sequences.
CAREER is first fit to large, passively-collected resume data, then fine-tuned to smaller, better-curated datasets for economic inferences.
We find that CAREER forms accurate predictions of job sequences, outperforming econometric baselines on three widely-used economics datasets.
arXiv Detail & Related papers (2022-02-16T23:23:50Z) - Improving Next-Application Prediction with Deep Personalized-Attention
Neural Network [27.71640897308797]
We propose to leverage next-item recommendation approaches to consider better the job seeker's career preference.
Our proposed model, named Personalized-Attention Next-Application Prediction (PANAP), is composed of three modules.
Experiments on the public CareerBuilder12 dataset show the interest in our approach.
arXiv Detail & Related papers (2021-11-09T10:52:57Z) - You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory
Prediction [52.442129609979794]
Recent deep learning approaches for trajectory prediction show promising performance.
It remains unclear which features such black-box models actually learn to use for making predictions.
This paper proposes a procedure that quantifies the contributions of different cues to model performance.
arXiv Detail & Related papers (2021-10-11T14:24:15Z) - Job2Vec: Job Title Benchmarking with Collective Multi-View
Representation Learning [51.34011135329063]
Job Title Benchmarking (JTB) aims at matching job titles with similar expertise levels across various companies.
Traditional JTB approaches mainly rely on manual market surveys, which is expensive and labor-intensive.
We reformulate the JTB as the task of link prediction over the Job-Graph that matched job titles should have links.
arXiv Detail & Related papers (2020-09-16T02:33:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.