JobBERT: Understanding Job Titles through Skills
- URL: http://arxiv.org/abs/2109.09605v1
- Date: Mon, 20 Sep 2021 15:00:10 GMT
- Title: JobBERT: Understanding Job Titles through Skills
- Authors: Jens-Joris Decorte, Jeroen Van Hautte, Thomas Demeester, Chris
Develder
- Abstract summary: Job titles form a cornerstone of today's human resources (HR) processes.
Job titles are a compact, convenient, and readily available data source.
We propose a neural representation model for job titles, by augmenting a pre-trained language model with co-occurrence information from skill labels extracted from vacancies.
- Score: 12.569546741576515
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Job titles form a cornerstone of today's human resources (HR) processes.
Within online recruitment, they allow candidates to understand the contents of
a vacancy at a glance, while internal HR departments use them to organize and
structure many of their processes. As job titles are a compact, convenient, and
readily available data source, modeling them with high accuracy can greatly
benefit many HR tech applications. In this paper, we propose a neural
representation model for job titles, by augmenting a pre-trained language model
with co-occurrence information from skill labels extracted from vacancies. Our
JobBERT method leads to considerable improvements compared to using generic
sentence encoders, for the task of job title normalization, for which we
release a new evaluation benchmark.
Related papers
- TAROT: A Hierarchical Framework with Multitask Co-Pretraining on
Semi-Structured Data towards Effective Person-Job Fit [60.31175803899285]
We propose TAROT, a hierarchical multitask co-pretraining framework, to better utilize structural and semantic information for informative text embeddings.
TAROT targets semi-structured text in profiles and jobs, and it is co-pretained with multi-grained pretraining tasks to constrain the acquired semantic information at each level.
arXiv Detail & Related papers (2024-01-15T07:57:58Z) - Distilling Large Language Models using Skill-Occupation Graph Context
for HR-Related Tasks [8.235367170516769]
We introduce the Resume-Job Description Benchmark (RJDB) to cater to a wide array of HR tasks.
Our benchmark includes over 50 thousand triples of job descriptions, matched resumes and unmatched resumes.
Our experiments reveal that the student models achieve near/better performance than the teacher model (GPT-4), affirming the effectiveness of the benchmark.
arXiv Detail & Related papers (2023-11-10T20:25:42Z) - SkillRec: A Data-Driven Approach to Job Skill Recommendation for Career
Insights [0.3121997724420106]
SkillRec collects and identifies the skill set required for a job based on the job descriptions published by companies hiring for these roles.
Based on our preliminary experiments on a dataset of 6,000 job titles and descriptions, SkillRec shows a promising performance in terms of accuracy and F1-score.
arXiv Detail & Related papers (2023-02-20T12:07:57Z) - A practical method for occupational skills detection in Vietnamese job
listings [0.16114012813668932]
Lack of accurate and timely labor market information leads to skill miss-matches.
Traditional approaches rely on existing taxonomy and/or large annotated data.
We propose a practical methodology for skill detection in Vietnamese job listings.
arXiv Detail & Related papers (2022-10-26T10:23:18Z) - KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in
Few-Shot NLP [68.43279384561352]
Existing data augmentation algorithms leverage task-independent rules or fine-tune general-purpose pre-trained language models.
These methods have trivial task-specific knowledge and are limited to yielding low-quality synthetic data for weak baselines in simple tasks.
We propose the Knowledge Mixture Data Augmentation Model (KnowDA): an encoder-decoder LM pretrained on a mixture of diverse NLP tasks.
arXiv Detail & Related papers (2022-06-21T11:34:02Z) - Predicting Job Titles from Job Descriptions with Multi-label Text
Classification [0.0]
We propose the multi-label classification approach for predicting relevant job titles from job description texts.
We implement the Bi-GRU-LSTM-CNN with different pre-trained language models to apply for the job titles prediction problem.
arXiv Detail & Related papers (2021-12-21T09:31:03Z) - XtremeDistilTransformers: Task Transfer for Task-agnostic Distillation [80.18830380517753]
We develop a new task-agnostic distillation framework XtremeDistilTransformers.
We study the transferability of several source tasks, augmentation resources and model architecture for distillation.
arXiv Detail & Related papers (2021-06-08T17:49:33Z) - Reinforced Iterative Knowledge Distillation for Cross-Lingual Named
Entity Recognition [54.92161571089808]
Cross-lingual NER transfers knowledge from rich-resource language to languages with low resources.
Existing cross-lingual NER methods do not make good use of rich unlabeled data in target languages.
We develop a novel approach based on the ideas of semi-supervised learning and reinforcement learning.
arXiv Detail & Related papers (2021-06-01T05:46:22Z) - DAGA: Data Augmentation with a Generation Approach for Low-resource
Tagging Tasks [88.62288327934499]
We propose a novel augmentation method with language models trained on the linearized labeled sentences.
Our method is applicable to both supervised and semi-supervised settings.
arXiv Detail & Related papers (2020-11-03T07:49:15Z) - Job2Vec: Job Title Benchmarking with Collective Multi-View
Representation Learning [51.34011135329063]
Job Title Benchmarking (JTB) aims at matching job titles with similar expertise levels across various companies.
Traditional JTB approaches mainly rely on manual market surveys, which is expensive and labor-intensive.
We reformulate the JTB as the task of link prediction over the Job-Graph that matched job titles should have links.
arXiv Detail & Related papers (2020-09-16T02:33:32Z) - Leveraging the Inherent Hierarchy of Vacancy Titles for Automated Job
Ontology Expansion [0.0]
We introduce a novel, purely data-driven approach towards the detection of new job titles.
Our method is conceptually simple, extremely efficient and competitive with traditional NER-based approaches.
arXiv Detail & Related papers (2020-04-06T16:55:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.