Scenic: A Language for Scenario Specification and Data Generation
- URL: http://arxiv.org/abs/2010.06580v1
- Date: Tue, 13 Oct 2020 17:58:31 GMT
- Title: Scenic: A Language for Scenario Specification and Data Generation
- Authors: Daniel J. Fremont and Edward Kim and Tommaso Dreossi and Shromona
Ghosh and Xiangyu Yue and Alberto L. Sangiovanni-Vincentelli and Sanjit A.
Seshia
- Abstract summary: We propose a new probabilistic programming language for the design and analysis of cyber-physical systems.
In this paper, we focus on systems like autonomous cars and robots, whose environment at any point in time is a'scene'
We design a domain-specific language, Scenic, for describing scenarios that are distributions over scenes and the behaviors of their agents over time.
- Score: 17.07493567658614
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a new probabilistic programming language for the design and
analysis of cyber-physical systems, especially those based on machine learning.
Specifically, we consider the problems of training a system to be robust to
rare events, testing its performance under different conditions, and debugging
failures. We show how a probabilistic programming language can help address
these problems by specifying distributions encoding interesting types of
inputs, then sampling these to generate specialized training and test data.
More generally, such languages can be used to write environment models, an
essential prerequisite to any formal analysis. In this paper, we focus on
systems like autonomous cars and robots, whose environment at any point in time
is a 'scene', a configuration of physical objects and agents. We design a
domain-specific language, Scenic, for describing scenarios that are
distributions over scenes and the behaviors of their agents over time. As a
probabilistic programming language, Scenic allows assigning distributions to
features of the scene, as well as declaratively imposing hard and soft
constraints over the scene. We develop specialized techniques for sampling from
the resulting distribution, taking advantage of the structure provided by
Scenic's domain-specific syntax. Finally, we apply Scenic in a case study on a
convolutional neural network designed to detect cars in road images, improving
its performance beyond that achieved by state-of-the-art synthetic data
generation methods.
Related papers
- Text2Scenario: Text-Driven Scenario Generation for Autonomous Driving Test [15.601818101020996]
Text2Scenario is a framework that autonomously generates simulation test scenarios that closely align with user specifications.
Result is an efficient and precise evaluation of diverse AD stacks void of the labor-intensive need for manual scenario configuration.
arXiv Detail & Related papers (2025-03-04T07:20:25Z) - Digestion Algorithm in Hierarchical Symbolic Forests: A Fast Text Normalization Algorithm and Semantic Parsing Framework for Specific Scenarios and Lightweight Deployment [0.0]
Text Normalization and Semantic Parsing have numerous applications in natural language processing, such as natural language programming, paraphrasing, data augmentation, constructing expert systems, text matching, and more.
Despite the prominent achievements of deep learning in Large Language Models (LLMs), the interpretability of neural network architectures is still poor, which affects their credibility and hence limits the deployments of risk-sensitive scenarios.
In certain scenario-specific domains with scarce data, rapidly obtaining a large number of supervised learning labels is challenging, and the workload of manually labeling data would be enormous.
DAHSF is proposed to address these issues, combining text
arXiv Detail & Related papers (2024-12-18T17:05:49Z) - ScenicNL: Generating Probabilistic Scenario Programs from Natural Language [22.314264838832287]
We present ScenarioNL, an AI System for creating scenario programs from natural language.
We generate these programs from police crash reports.
We evaluate our system on publicly available autonomous vehicle crash reports in California from the last five years.
arXiv Detail & Related papers (2024-05-03T23:06:31Z) - LaMPP: Language Models as Probabilistic Priors for Perception and Action [38.07277869107474]
We show how to leverage language models for non-linguistic perception and control tasks.
Our approach casts labeling and decision-making as inference in probabilistic graphical models.
arXiv Detail & Related papers (2023-02-03T15:14:04Z) - Leveraging Natural Supervision for Language Representation Learning and
Generation [8.083109555490475]
We describe three lines of work that seek to improve the training and evaluation of neural models using naturally-occurring supervision.
We first investigate self-supervised training losses to help enhance the performance of pretrained language models for various NLP tasks.
We propose a framework that uses paraphrase pairs to disentangle semantics and syntax in sentence representations.
arXiv Detail & Related papers (2022-07-21T17:26:03Z) - Pre-Trained Language Models for Interactive Decision-Making [72.77825666035203]
We describe a framework for imitation learning in which goals and observations are represented as a sequence of embeddings.
We demonstrate that this framework enables effective generalization across different environments.
For test tasks involving novel goals or novel scenes, initializing policies with language models improves task completion rates by 43.6%.
arXiv Detail & Related papers (2022-02-03T18:55:52Z) - Addressing the IEEE AV Test Challenge with Scenic and VerifAI [10.221093591444731]
This paper summarizes our formal approach to testing autonomous vehicles (AVs) in simulation for the IEEE AV Test Challenge.
We demonstrate a systematic testing framework leveraging our previous work on formally-driven simulation for intelligent cyber-physical systems.
arXiv Detail & Related papers (2021-08-20T04:51:27Z) - Towards Zero-shot Language Modeling [90.80124496312274]
We construct a neural model that is inductively biased towards learning human languages.
We infer this distribution from a sample of typologically diverse training languages.
We harness additional language-specific side information as distant supervision for held-out languages.
arXiv Detail & Related papers (2021-08-06T23:49:18Z) - FDMT: A Benchmark Dataset for Fine-grained Domain Adaptation in Machine
Translation [53.87731008029645]
We present a real-world fine-grained domain adaptation task in machine translation (FDMT)
The FDMT dataset consists of four sub-domains of information technology: autonomous vehicles, AI education, real-time networks and smart phone.
We make quantitative experiments and deep analyses in this new setting, which benchmarks the fine-grained domain adaptation task.
arXiv Detail & Related papers (2020-12-31T17:15:09Z) - Rearrangement: A Challenge for Embodied AI [229.8891614821016]
We describe a framework for research and evaluation in Embodied AI.
Our proposal is based on a canonical task: Rearrangement.
We present experimental testbeds of rearrangement scenarios in four different simulation environments.
arXiv Detail & Related papers (2020-11-03T19:42:32Z) - Linguistic Typology Features from Text: Inferring the Sparse Features of
World Atlas of Language Structures [73.06435180872293]
We construct a recurrent neural network predictor based on byte embeddings and convolutional layers.
We show that some features from various linguistic types can be predicted reliably.
arXiv Detail & Related papers (2020-04-30T21:00:53Z) - Synthetic Datasets for Neural Program Synthesis [66.20924952964117]
We propose a new methodology for controlling and evaluating the bias of synthetic data distributions over both programs and specifications.
We demonstrate, using the Karel DSL and a small Calculator DSL, that training deep networks on these distributions leads to improved cross-distribution generalization performance.
arXiv Detail & Related papers (2019-12-27T21:28:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.