WikiCoder: Learning to Write Knowledge-Powered Code
- URL: http://arxiv.org/abs/2303.08574v1
- Date: Wed, 15 Mar 2023 12:50:54 GMT
- Title: WikiCoder: Learning to Write Knowledge-Powered Code
- Authors: Th\'eo Matricon, Nathana\"el Fijalkow, Ga\"etan Margueritte
- Abstract summary: This paper makes a first step towards knowledge-powered program synthesis.
We present WikiCoder, a system building upon state of the art machine-learned program synthesizers and integrating knowledge graphs.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We tackle the problem of automatic generation of computer programs from a few
pairs of input-output examples. The starting point of this work is the
observation that in many applications a solution program must use external
knowledge not present in the examples: we call such programs knowledge-powered
since they can refer to information collected from a knowledge graph such as
Wikipedia. This paper makes a first step towards knowledge-powered program
synthesis. We present WikiCoder, a system building upon state of the art
machine-learned program synthesizers and integrating knowledge graphs. We
evaluate it to show its wide applicability over different domains and discuss
its limitations. WikiCoder solves tasks that no program synthesizers were able
to solve before thanks to the use of knowledge graphs, while integrating with
recent developments in the field to operate at scale.
Related papers
- A Symbolic Computing Perspective on Software Systems [0.0]
Symbolic mathematical computing systems have served as a canary in the coal mine of software systems for more than sixty years.
All of the major symbolic mathematical computing systems include low-level code for arithmetic, memory management and other primitives, a compiler or interpreter for a bespoke programming language, a library of high level mathematical algorithms, and some form of user interface.
arXiv Detail & Related papers (2024-06-13T13:10:47Z) - Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit [63.82016263181941]
Code intelligence leverages machine learning techniques to extract knowledge from extensive code corpora.
Currently, there is already a thriving research community focusing on code intelligence.
arXiv Detail & Related papers (2023-12-30T17:48:37Z) - Beyond Factuality: A Comprehensive Evaluation of Large Language Models
as Knowledge Generators [78.63553017938911]
Large language models (LLMs) outperform information retrieval techniques for downstream knowledge-intensive tasks.
However, community concerns abound regarding the factuality and potential implications of using this uncensored knowledge.
We introduce CONNER, designed to evaluate generated knowledge from six important perspectives.
arXiv Detail & Related papers (2023-10-11T08:22:37Z) - Collaborative, Code-Proximal Dynamic Software Visualization within Code
Editors [55.57032418885258]
This paper introduces the design and proof-of-concept implementation for a software visualization approach that can be embedded into code editors.
Our contribution differs from related work in that we use dynamic analysis of a software system's runtime behavior.
Our visualization approach enhances common remote pair programming tools and is collaboratively usable by employing shared code cities.
arXiv Detail & Related papers (2023-08-30T06:35:40Z) - A Roadmap to Domain Knowledge Integration in Machine Learning [21.96548398967003]
Integrating knowledge in a machine learning model can help to overcome these obstacles up to a certain degree.
We will give a brief overview of these different forms of knowledge integration and their performance in certain machine learning tasks.
arXiv Detail & Related papers (2022-12-12T05:35:44Z) - Hardware-agnostic Computation for Large-scale Knowledge Graph Embeddings [1.1650381752104297]
We develop a framework to compute embeddings for large-scale knowledge graphs in a hardware-agnostic manner.
We provide an open-source version of our framework along with a hub of pre-trained models having more than 11.4 B parameters.
arXiv Detail & Related papers (2022-07-18T12:10:27Z) - Leveraging Language to Learn Program Abstractions and Search Heuristics [66.28391181268645]
We introduce LAPS (Language for Abstraction and Program Search), a technique for using natural language annotations to guide joint learning of libraries and neurally-guided search models for synthesis.
When integrated into a state-of-the-art library learning system (DreamCoder), LAPS produces higher-quality libraries and improves search efficiency and generalization.
arXiv Detail & Related papers (2021-06-18T15:08:47Z) - PyKale: Knowledge-Aware Machine Learning from Multiple Sources in Python [6.276936701568444]
Pykale is a Python library for knowledge-aware machine learning on graphs, images, texts, and videos.
We formulate new green machine learning guidelines based on standard software engineering practices.
We build PyKale on PyTorch and leverage the rich PyTorch ecosystem.
arXiv Detail & Related papers (2021-06-17T18:35:37Z) - Improving Compositionality of Neural Networks by Decoding
Representations to Inputs [83.97012077202882]
We bridge the benefits of traditional and deep learning programs by jointly training a generative model to constrain neural network activations to "decode" back to inputs.
We demonstrate applications of decodable representations to out-of-distribution detection, adversarial examples, calibration, and fairness.
arXiv Detail & Related papers (2021-06-01T20:07:16Z) - Machine Learning for Software Engineering: A Systematic Mapping [73.30245214374027]
The software development industry is rapidly adopting machine learning for transitioning modern day software systems towards highly intelligent and self-learning systems.
No comprehensive study exists that explores the current state-of-the-art on the adoption of machine learning across software engineering life cycle stages.
This study introduces a machine learning for software engineering (MLSE) taxonomy classifying the state-of-the-art machine learning techniques according to their applicability to various software engineering life cycle stages.
arXiv Detail & Related papers (2020-05-27T11:56:56Z) - Turning 30: New Ideas in Inductive Logic Programming [18.581514902689346]
inductive logic programming is a form of machine learning that induces logic programs from data.
We focus on new methods for learning programs that generalise from few examples.
We also discuss directions for future research in inductive logic programming.
arXiv Detail & Related papers (2020-02-25T16:23:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.