Hierarchical Interdisciplinary Topic Detection Model for Research
Proposal Classification
- URL: http://arxiv.org/abs/2209.13519v1
- Date: Fri, 16 Sep 2022 16:59:25 GMT
- Title: Hierarchical Interdisciplinary Topic Detection Model for Research
Proposal Classification
- Authors: Meng Xiao, Ziyue Qiao, Yanjie Fu, Hao Dong, Yi Du, Pengyang Wang, Hui
Xiong, Yuanchun Zhou
- Abstract summary: We develop a deep Hierarchical Interdisciplinary Research Proposal Classification Network (HIRPCN)
We first propose a hierarchical transformer to extract the textual semantic information of proposals.
We then design an interdisciplinary graph and leverage GNNs for learning representations of each discipline.
- Score: 33.06389455749012
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The peer merit review of research proposals has been the major mechanism for
deciding grant awards. However, research proposals have become increasingly
interdisciplinary. It has been a longstanding challenge to assign
interdisciplinary proposals to appropriate reviewers, so proposals are fairly
evaluated. One of the critical steps in reviewer assignment is to generate
accurate interdisciplinary topic labels for proposal-reviewer matching.
Existing systems mainly collect topic labels manually generated by principal
investigators. However, such human-reported labels can be non-accurate,
incomplete, labor intensive, and time costly. What role can AI play in
developing a fair and precise proposal reviewer assignment system? In this
study, we collaborate with the National Science Foundation of China to address
the task of automated interdisciplinary topic path detection. For this purpose,
we develop a deep Hierarchical Interdisciplinary Research Proposal
Classification Network (HIRPCN). Specifically, we first propose a hierarchical
transformer to extract the textual semantic information of proposals. We then
design an interdisciplinary graph and leverage GNNs for learning
representations of each discipline in order to extract interdisciplinary
knowledge. After extracting the semantic and interdisciplinary knowledge, we
design a level-wise prediction component to fuse the two types of knowledge
representations and detect interdisciplinary topic paths for each proposal. We
conduct extensive experiments and expert evaluations on three real-world
datasets to demonstrate the effectiveness of our proposed model.
Related papers
- Good Idea or Not, Representation of LLM Could Tell [86.36317971482755]
We focus on idea assessment, which aims to leverage the knowledge of large language models to assess the merit of scientific ideas.
We release a benchmark dataset from nearly four thousand manuscript papers with full texts, meticulously designed to train and evaluate the performance of different approaches to this task.
Our findings suggest that the representations of large language models hold more potential in quantifying the value of ideas than their generative outputs.
arXiv Detail & Related papers (2024-09-07T02:07:22Z) - Resolving the Imbalance Issue in Hierarchical Disciplinary Topic
Inference via LLM-based Data Augmentation [5.98277339029019]
This study leverages large language models (Llama V1) as data generators to augment research proposals categorized within intricate disciplinary hierarchies.
Our experiments attest to the efficacy of the generated data, demonstrating that research proposals produced using the prompts can effectively address the aforementioned issues.
arXiv Detail & Related papers (2023-10-09T00:45:20Z) - Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation [26.30701957043284]
Automated topic inference can reduce human errors caused by manual topic filling, bridge the knowledge gap between funding agencies and project applicants, and improve system efficiency.
Existing methods overlook the gap in scale between interdisciplinary research proposals and non-interdisciplinary ones, leading to an unjust phenomenon.
In this paper, we implement a topic label inference system based on a Transformer encoder-decoder architecture.
arXiv Detail & Related papers (2023-09-04T16:54:49Z) - Hierarchical MixUp Multi-label Classification with Imbalanced
Interdisciplinary Research Proposals [22.458438099629277]
We propose a hierarchical mixup multiple-label classification framework, which we called H-MixUp.
The number of proposals is imbalanced between non-interdisciplinary and interdisciplinary research.
We develop a fused training method of Wold-level MixUp, Word-level CutMix, Manifold MixUp, and Document-level MixUp to address the third issue.
arXiv Detail & Related papers (2022-09-28T08:27:52Z) - Knowledge-Aware Bayesian Deep Topic Model [50.58975785318575]
We propose a Bayesian generative model for incorporating prior domain knowledge into hierarchical topic modeling.
Our proposed model efficiently integrates the prior knowledge and improves both hierarchical topic discovery and document representation.
arXiv Detail & Related papers (2022-09-20T09:16:05Z) - Who Should Review Your Proposal? Interdisciplinary Topic Path Detection
for Research Proposals [24.995369698179317]
It has been a longstanding challenge to assign proposals to appropriate reviewers.
Existing systems mainly collect topic labels manually reported by discipline investigators.
What role can AI play in developing a fair and precise proposal review system?
arXiv Detail & Related papers (2022-03-07T03:30:50Z) - TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel
Topic Clusters [57.59286394188025]
We propose a novel framework for topic taxonomy completion, named TaxoCom.
TaxoCom discovers novel sub-topic clusters of terms and documents.
Our comprehensive experiments on two real-world datasets demonstrate that TaxoCom not only generates the high-quality topic taxonomy in terms of term coherency and topic coverage.
arXiv Detail & Related papers (2022-01-18T07:07:38Z) - An Explanatory Query-Based Framework for Exploring Academic Expertise [10.887008988767061]
Finding potential collaborators in institutions is a time-consuming manual search task prone to bias.
We propose a novel query-based framework for searching, scoring, and exploring research expertise automatically.
We show that our simple method is effective in identifying matches, while satisfying desirable properties and being efficient.
arXiv Detail & Related papers (2021-05-28T10:48:08Z) - What's New? Summarizing Contributions in Scientific Literature [85.95906677964815]
We introduce a new task of disentangled paper summarization, which seeks to generate separate summaries for the paper contributions and the context of the work.
We extend the S2ORC corpus of academic articles by adding disentangled "contribution" and "context" reference labels.
We propose a comprehensive automatic evaluation protocol which reports the relevance, novelty, and disentanglement of generated outputs.
arXiv Detail & Related papers (2020-11-06T02:23:01Z) - Topic-Aware Multi-turn Dialogue Modeling [91.52820664879432]
This paper presents a novel solution for multi-turn dialogue modeling, which segments and extracts topic-aware utterances in an unsupervised way.
Our topic-aware modeling is implemented by a newly proposed unsupervised topic-aware segmentation algorithm and Topic-Aware Dual-attention Matching (TADAM) Network.
arXiv Detail & Related papers (2020-09-26T08:43:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.