PatentMiner: Patent Vacancy Mining via Context-enhanced and
Knowledge-guided Graph Attention
- URL: http://arxiv.org/abs/2107.04880v1
- Date: Sat, 10 Jul 2021 17:34:57 GMT
- Title: PatentMiner: Patent Vacancy Mining via Context-enhanced and
Knowledge-guided Graph Attention
- Authors: Gaochen Wu, Bin Xu, Yuxin Qin, Fei Kong, Bangchang Liu, Hongwen Zhao,
Dejie Chang
- Abstract summary: We propose a new patent vacancy prediction approach named PatentMiner to mine rich semantic knowledge and predict new potential patents.
Patent knowledge graph over time (e.g. year) is constructed by carrying out named entity recognition and relation extrac-tion from patent documents.
Common Neighbor Method (CNM), Graph Attention Networks (GAT) and Context-enhanced Graph Attention Networks (CGAT) are proposed to perform link prediction in the constructed knowledge graph.
- Score: 2.9290732102216452
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Although there are a small number of work to conduct patent research by
building knowledge graph, but without constructing patent knowledge graph using
patent documents and combining latest natural language processing methods to
mine hidden rich semantic relationships in existing patents and predict new
possible patents. In this paper, we propose a new patent vacancy prediction
approach named PatentMiner to mine rich semantic knowledge and predict new
potential patents based on knowledge graph (KG) and graph attention mechanism.
Firstly, patent knowledge graph over time (e.g. year) is constructed by
carrying out named entity recognition and relation extrac-tion from patent
documents. Secondly, Common Neighbor Method (CNM), Graph Attention Networks
(GAT) and Context-enhanced Graph Attention Networks (CGAT) are proposed to
perform link prediction in the constructed knowledge graph to dig out the
potential triples. Finally, patents are defined on the knowledge graph by means
of co-occurrence relationship, that is, each patent is represented as a fully
connected subgraph containing all its entities and co-occurrence relationships
of the patent in the knowledge graph; Furthermore, we propose a new patent
prediction task which predicts a fully connected subgraph with newly added
prediction links as a new pa-tent. The experimental results demonstrate that
our proposed patent predic-tion approach can correctly predict new patents and
Context-enhanced Graph Attention Networks is much better than the baseline.
Meanwhile, our proposed patent vacancy prediction task still has significant
room to im-prove.
Related papers
- PatentEdits: Framing Patent Novelty as Textual Entailment [62.8514393375952]
We introduce the PatentEdits dataset, which contains 105K examples of successful revisions.
We design algorithms to label edits sentence by sentence, then establish how well these edits can be predicted with large language models.
We demonstrate that evaluating textual entailment between cited references and draft sentences is especially effective in predicting which inventive claims remained unchanged or are novel in relation to prior art.
arXiv Detail & Related papers (2024-11-20T17:23:40Z) - Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase Graphs [18.86788223751979]
We study the patent phrase similarity inference task, which measures the semantic similarity between two patent phrases.
We introduce a graph-augmented approach to amplify the global contextual information of the patent phrases.
arXiv Detail & Related papers (2024-03-24T18:59:38Z) - PaECTER: Patent-level Representation Learning using Citation-informed
Transformers [0.16785092703248325]
PaECTER is a publicly available, open-source document-level encoder specific for patents.
We fine-tune BERT for Patents with examiner-added citation information to generate numerical representations for patent documents.
PaECTER performs better in similarity tasks than current state-of-the-art models used in the patent domain.
arXiv Detail & Related papers (2024-02-29T18:09:03Z) - Unveiling Black-boxes: Explainable Deep Learning Models for Patent
Classification [48.5140223214582]
State-of-the-art methods for multi-label patent classification rely on deep opaque neural networks (DNNs)
We propose a novel deep explainable patent classification framework by introducing layer-wise relevance propagation (LRP)
Considering the relevance score, we then generate explanations by visualizing relevant words for the predicted patent class.
arXiv Detail & Related papers (2023-10-31T14:11:37Z) - Graph Representation Learning Towards Patents Network Analysis [2.202803272456695]
This research employed a graph representation learning approach to create, analyze, and find similarities in the patent data registered in the Iranian Official Gazette.
Key entities were extracted from the scrapped patents dataset to create the Iranian patents graph from scratch.
Thanks to the utilization of novel graph algorithms and text mining methods, we identified new areas of industry and research from Iranian patent data.
arXiv Detail & Related papers (2023-09-25T05:49:40Z) - Event-based Dynamic Graph Representation Learning for Patent Application
Trend Prediction [45.0907126466271]
We propose an event-based graph learning framework for patent application trend prediction.
In particular, our method is founded on the memorable representations of both companies and patent classification codes.
arXiv Detail & Related papers (2023-08-04T05:43:32Z) - Citation Trajectory Prediction via Publication Influence Representation
Using Temporal Knowledge Graph [52.07771598974385]
Existing approaches mainly rely on mining temporal and graph data from academic articles.
Our framework is composed of three modules: difference-preserved graph embedding, fine-grained influence representation, and learning-based trajectory calculation.
Experiments are conducted on both the APS academic dataset and our contributed AIPatent dataset.
arXiv Detail & Related papers (2022-10-02T07:43:26Z) - The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and
Multi-Purpose Corpus of Patent Applications [8.110699646062384]
We introduce the Harvard USPTO Patent dataset (HUPD)
With more than 4.5 million patent documents, HUPD is two to three times larger than comparable corpora.
By providing each application's metadata along with all of its text fields, the dataset enables researchers to perform new sets of NLP tasks.
arXiv Detail & Related papers (2022-07-08T17:57:15Z) - A Survey on Sentence Embedding Models Performance for Patent Analysis [0.0]
We propose a standard library and dataset for assessing the accuracy of embeddings models based on PatentSBERTa approach.
Results show PatentSBERTa, Bert-for-patents, and TF-IDF Weighted Word Embeddings have the best accuracy for computing sentence embeddings at the subclass level.
arXiv Detail & Related papers (2022-04-28T12:04:42Z) - Dynamic Semantic Graph Construction and Reasoning for Explainable
Multi-hop Science Question Answering [50.546622625151926]
We propose a new framework to exploit more valid facts while obtaining explainability for multi-hop QA.
Our framework contains three new ideas: (a) tt AMR-SG, an AMR-based Semantic Graph, constructed by candidate fact AMRs to uncover any hop relations among question, answer and multiple facts, (b) a novel path-based fact analytics approach exploiting tt AMR-SG to extract active facts from a large fact pool to answer questions, and (c) a fact-level relation modeling leveraging graph convolution network (GCN) to guide the reasoning process.
arXiv Detail & Related papers (2021-05-25T09:14:55Z) - Graph Learning based Recommender Systems: A Review [111.43249652335555]
Graph Learning based Recommender Systems (GLRS) employ advanced graph learning approaches to model users' preferences and intentions as well as items' characteristics for recommendations.
We provide a systematic review of GLRS, by discussing how they extract important knowledge from graph-based representations to improve the accuracy, reliability and explainability of the recommendations.
arXiv Detail & Related papers (2021-05-13T14:50:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.