Environmental Claim Detection
- URL: http://arxiv.org/abs/2209.00507v4
- Date: Fri, 26 May 2023 07:25:47 GMT
- Title: Environmental Claim Detection
- Authors: Dominik Stammbach, Nicolas Webersinke, Julia Anna Bingler, Mathias
Kraus, Markus Leippold
- Abstract summary: This paper introduces the task of environmental claim detection.
We release an expert-annotated dataset and models trained on this dataset.
We find that the number of environmental claims has steadily increased since the Paris Agreement in 2015.
- Score: 6.2887102994549595
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: To transition to a green economy, environmental claims made by companies must
be reliable, comparable, and verifiable. To analyze such claims at scale,
automated methods are needed to detect them in the first place. However, there
exist no datasets or models for this. Thus, this paper introduces the task of
environmental claim detection. To accompany the task, we release an
expert-annotated dataset and models trained on this dataset. We preview one
potential application of such models: We detect environmental claims made in
quarterly earning calls and find that the number of environmental claims has
steadily increased since the Paris Agreement in 2015.
Related papers
- Contrastive Learning to Improve Retrieval for Real-world Fact Checking [84.57583869042791]
We present Contrastive Fact-Checking Reranker (CFR), an improved retriever for fact-checking complex claims.
We leverage the AVeriTeC dataset, which annotates subquestions for claims with human written answers from evidence documents.
We find a 6% improvement in veracity classification accuracy on the dataset.
arXiv Detail & Related papers (2024-10-07T00:09:50Z) - Estimating Environmental Cost Throughout Model's Adaptive Life Cycle [2.93774265594295]
PreIndex is a predictive index to estimate the environmental and compute resources associated with model retraining to distributional shifts in data.
It can be used to estimate environmental costs such as carbon emissions and energy usage when retraining from current data distribution to new data distribution.
arXiv Detail & Related papers (2024-07-23T03:58:06Z) - EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection [0.0]
EcoVerse is an annotated English Twitter dataset of 3,023 tweets spanning a wide spectrum of environmental topics.
We propose a three-level annotation scheme designed for Eco-Relevance Classification, Stance Detection, and introducing an original approach for Environmental Impact Analysis.
arXiv Detail & Related papers (2024-04-08T01:21:11Z) - ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation
in Construction Environments [1.4070907500169874]
This paper introduces a new semantic segmentation dataset specifically tailored for construction sites.
The dataset is designed to enhance the training and evaluation of object detection models.
arXiv Detail & Related papers (2023-12-27T10:49:19Z) - Leveraging Language Models to Detect Greenwashing [39.58317527488534]
We introduce a novel preliminary methodology to train a language model on generated labels for greenwashing risk.
Our best model achieved an average accuracy score of 86.34% and F1 score of 0.67, demonstrating that our proof-of-concept methodology shows a promising direction of exploration.
arXiv Detail & Related papers (2023-10-30T21:41:49Z) - A Comparative Study of Machine Learning Algorithms for Anomaly Detection
in Industrial Environments: Performance and Environmental Impact [62.997667081978825]
This study seeks to address the demands of high-performance machine learning models with environmental sustainability.
Traditional machine learning algorithms, such as Decision Trees and Random Forests, demonstrate robust efficiency and performance.
However, superior outcomes were obtained with optimised configurations, albeit with a commensurate increase in resource consumption.
arXiv Detail & Related papers (2023-07-01T15:18:00Z) - WiCE: Real-World Entailment for Claims in Wikipedia [63.234352061821625]
We propose WiCE, a new fine-grained textual entailment dataset built on natural claim and evidence pairs extracted from Wikipedia.
In addition to standard claim-level entailment, WiCE provides entailment judgments over sub-sentence units of the claim.
We show that real claims in our dataset involve challenging verification and retrieval problems that existing models fail to address.
arXiv Detail & Related papers (2023-03-02T17:45:32Z) - Counting Carbon: A Survey of Factors Influencing the Emissions of
Machine Learning [77.62876532784759]
Machine learning (ML) requires using energy to carry out computations during the model training process.
The generation of this energy comes with an environmental cost in terms of greenhouse gas emissions, depending on quantity used and the energy source.
We present a survey of the carbon emissions of 95 ML models across time and different tasks in natural language processing and computer vision.
arXiv Detail & Related papers (2023-02-16T18:35:00Z) - Greenhouse gases emissions: estimating corporate non-reported emissions
using interpretable machine learning [0.0]
As of 2022, greenhouse gases (GHG) emissions reporting and auditing are not yet compulsory for all companies.
We propose a machine learning-based model to estimate scope 1 and scope 2 GHG emissions of companies not reporting them yet.
arXiv Detail & Related papers (2022-12-21T08:36:02Z) - AmbiFC: Fact-Checking Ambiguous Claims with Evidence [57.7091560922174]
We present AmbiFC, a fact-checking dataset with 10k claims derived from real-world information needs.
We analyze disagreements arising from ambiguity when comparing claims against evidence in AmbiFC.
We develop models for predicting veracity handling this ambiguity via soft labels.
arXiv Detail & Related papers (2021-04-01T17:40:08Z) - Analyzing Sustainability Reports Using Natural Language Processing [68.8204255655161]
In recent years, companies have increasingly been aiming to both mitigate their environmental impact and adapt to the changing climate context.
This is reported via increasingly exhaustive reports, which cover many types of climate risks and exposures under the umbrella of Environmental, Social, and Governance (ESG)
We present this tool and the methodology that we used to develop it in the present article.
arXiv Detail & Related papers (2020-11-03T21:22:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.