GenericsKB: A Knowledge Base of Generic Statements
- URL: http://arxiv.org/abs/2005.00660v1
- Date: Sat, 2 May 2020 00:08:42 GMT
- Title: GenericsKB: A Knowledge Base of Generic Statements
- Authors: Sumithra Bhakthavatsalam, Chloe Anastasiades, Peter Clark
- Abstract summary: We present a new resource for the NLP community, namely a large (3.5M+ sentence) knowledge base of *generic statements*
This is the first large resource to contain *naturally occurring* generic sentences, as opposed to extracted or crowdsourced triples.
All GenericsKB sentences are annotated with their topical term, surrounding context (sentences), and a (learned) confidence.
- Score: 18.68800894936855
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a new resource for the NLP community, namely a large (3.5M+
sentence) knowledge base of *generic statements*, e.g., "Trees remove carbon
dioxide from the atmosphere", collected from multiple corpora. This is the
first large resource to contain *naturally occurring* generic sentences, as
opposed to extracted or crowdsourced triples, and thus is rich in high-quality,
general, semantically complete statements. All GenericsKB sentences are
annotated with their topical term, surrounding context (sentences), and a
(learned) confidence. We also release GenericsKB-Best (1M+ sentences),
containing the best-quality generics in GenericsKB augmented with selected,
synthesized generics from WordNet and ConceptNet. In tests on two existing
datasets requiring multihop reasoning (OBQA and QASC), we find using GenericsKB
can result in higher scores and better explanations than using a much larger
corpus. This demonstrates that GenericsKB can be a useful resource for NLP
applications, as well as providing data for linguistic studies of generics and
their semantics. GenericsKB is available at
https://allenai.org/data/genericskb.
Related papers
- BigText-QA: Question Answering over a Large-Scale Hybrid Knowledge Graph [23.739432128095107]
BigText-QA is able to answer questions based on a structured knowledge graph.
Our results demonstrate that BigText-QA outperforms DrQA, a neural-network-based QA system, and achieves competitive results to QUEST, a graph-based unsupervised QA system.
arXiv Detail & Related papers (2022-12-12T09:49:02Z) - Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge
Base and Database [86.03294330305097]
We propose a unified semantic element for question answering (QA) on both knowledge bases (KB) and databases (DB)
We introduce the primitive (relation and entity in KB, table name, column name and cell value in DB) as an essential element in our framework.
We leverage the generator to predict final logical forms by altering and composing topranked primitives with different operations.
arXiv Detail & Related papers (2022-11-09T19:33:27Z) - Penguins Don't Fly: Reasoning about Generics through Instantiations and
Exceptions [73.56753518339247]
We present a novel framework informed by linguistic theory to generate exemplars -- specific cases when a generic holds true or false.
We generate 19k exemplars for 650 generics and show that our framework outperforms a strong GPT-3 baseline by 12.8 precision points.
arXiv Detail & Related papers (2022-05-23T22:45:53Z) - Incorporating Constituent Syntax for Coreference Resolution [50.71868417008133]
We propose a graph-based method to incorporate constituent syntactic structures.
We also explore to utilise higher-order neighbourhood information to encode rich structures in constituent trees.
Experiments on the English and Chinese portions of OntoNotes 5.0 benchmark show that our proposed model either beats a strong baseline or achieves new state-of-the-art performance.
arXiv Detail & Related papers (2022-02-22T07:40:42Z) - Automatic Construction of Sememe Knowledge Bases via Dictionaries [53.8700954466358]
Sememe knowledge bases (SKBs) enable sememes to be applied to natural language processing.
Most languages have no SKBs, and manual construction of SKBs is time-consuming and labor-intensive.
We propose a simple and fully automatic method of building an SKB via an existing dictionary.
arXiv Detail & Related papers (2021-05-26T14:41:01Z) - Quinductor: a multilingual data-driven method for generating
reading-comprehension questions using Universal Dependencies [0.0]
We propose a multilingual data-driven method for generating reading comprehension questions using dependency trees.
Our method provides a strong, mostly deterministic, and inexpensive-to-train baseline for less-resourced languages.
arXiv Detail & Related papers (2021-03-18T09:49:56Z) - Reasoning Over Virtual Knowledge Bases With Open Predicate Relations [85.19305347984515]
We present the Open Predicate Query Language (OPQL)
OPQL is a method for constructing a virtual Knowledge Base (VKB) trained entirely from text.
We demonstrate that OPQL outperforms prior VKB methods on two different KB reasoning tasks.
arXiv Detail & Related papers (2021-02-14T01:29:54Z) - Beyond I.I.D.: Three Levels of Generalization for Question Answering on
Knowledge Bases [63.43418760818188]
We release a new large-scale, high-quality dataset with 64,331 questions, GrailQA.
We propose a novel BERT-based KBQA model.
The combination of our dataset and model enables us to thoroughly examine and demonstrate, for the first time, the key role of pre-trained contextual embeddings like BERT in the generalization of KBQA.
arXiv Detail & Related papers (2020-11-16T06:36:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.