Greenhouse gases emissions: estimating corporate non-reported emissions
using interpretable machine learning
- URL: http://arxiv.org/abs/2212.10844v1
- Date: Wed, 21 Dec 2022 08:36:02 GMT
- Title: Greenhouse gases emissions: estimating corporate non-reported emissions
using interpretable machine learning
- Authors: Jeremi Assael (BNPP CIB GM Lab, MICS), Thibaut Heurtebize, Laurent
Carlier (BNPP CIB GM Lab), Fran\c{c}ois Soup\'e
- Abstract summary: As of 2022, greenhouse gases (GHG) emissions reporting and auditing are not yet compulsory for all companies.
We propose a machine learning-based model to estimate scope 1 and scope 2 GHG emissions of companies not reporting them yet.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As of 2022, greenhouse gases (GHG) emissions reporting and auditing are not
yet compulsory for all companies and methodologies of measurement and
estimation are not unified. We propose a machine learning-based model to
estimate scope 1 and scope 2 GHG emissions of companies not reporting them yet.
Our model, specifically designed to be transparent and completely adapted to
this use case, is able to estimate emissions for a large universe of companies.
It shows good out-of-sample global performances as well as good out-of-sample
granular performances when evaluating it by sectors, by countries or by
revenues buckets. We also compare our results to those of other providers and
find our estimates to be more accurate. Thanks to the proposed explainability
tools using Shapley values, our model is fully interpretable, the user being
able to understand which factors split explain the GHG emissions for each
particular company.
Related papers
- Climate AI for Corporate Decarbonization Metrics Extraction [7.522638089716454]
We introduce the Climate Artificial Intelligence for Corporate Decarbonization Metrics Extraction (CAI) model and pipeline.
We demonstrate that the process improves data collection efficiency and accuracy by automating data curation, validation, and metric scoring from public corporate disclosures.
arXiv Detail & Related papers (2024-11-05T18:37:51Z) - Reporting and Analysing the Environmental Impact of Language Models on the Example of Commonsense Question Answering with External Knowledge [7.419725234099729]
ChatGPT sparked social interest in Large Language Models (LLMs)
LLMs demand substantial computational resources and are very costly to train, both financially and environmentally.
In this study, we infused T5 LLM with external knowledge and fine-tuned the model for Question-Answering task.
arXiv Detail & Related papers (2024-07-24T16:16:16Z) - Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions [75.45274978665684]
Vision-Language Understanding (VLU) benchmarks contain samples where answers rely on assumptions unsupported by the provided context.
We collect contextual data for each sample whenever available and train a context selection module to facilitate evidence-based model predictions.
We develop a general-purpose Context-AwaRe Abstention detector to identify samples lacking sufficient context and enhance model accuracy.
arXiv Detail & Related papers (2024-05-18T02:21:32Z) - A Comparative Study of Machine Learning Algorithms for Anomaly Detection
in Industrial Environments: Performance and Environmental Impact [62.997667081978825]
This study seeks to address the demands of high-performance machine learning models with environmental sustainability.
Traditional machine learning algorithms, such as Decision Trees and Random Forests, demonstrate robust efficiency and performance.
However, superior outcomes were obtained with optimised configurations, albeit with a commensurate increase in resource consumption.
arXiv Detail & Related papers (2023-07-01T15:18:00Z) - Gaussian Process Probes (GPP) for Uncertainty-Aware Probing [61.91898698128994]
We introduce a unified and simple framework for probing and measuring uncertainty about concepts represented by models.
Our experiments show it can (1) probe a model's representations of concepts even with a very small number of examples, (2) accurately measure both epistemic uncertainty (how confident the probe is) and aleatory uncertainty (how fuzzy the concepts are to the model), and (3) detect out of distribution data using those uncertainty measures as well as classic methods do.
arXiv Detail & Related papers (2023-05-29T17:00:16Z) - ReCOGS: How Incidental Details of a Logical Form Overshadow an
Evaluation of Semantic Interpretation [63.33465936588327]
We propose a modified version of the compositional generalization benchmark COGS.
Our results reaffirm the importance of compositional generalization and careful benchmark task design.
arXiv Detail & Related papers (2023-03-24T00:01:24Z) - Counting Carbon: A Survey of Factors Influencing the Emissions of
Machine Learning [77.62876532784759]
Machine learning (ML) requires using energy to carry out computations during the model training process.
The generation of this energy comes with an environmental cost in terms of greenhouse gas emissions, depending on quantity used and the energy source.
We present a survey of the carbon emissions of 95 ML models across time and different tasks in natural language processing and computer vision.
arXiv Detail & Related papers (2023-02-16T18:35:00Z) - Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language
Model [72.65502770895417]
We quantify the carbon footprint of BLOOM, a 176-billion parameter language model, across its life cycle.
We estimate that BLOOM's final training emitted approximately 24.7 tonnes ofcarboneqif we consider only the dynamic power consumption.
We conclude with a discussion regarding the difficulty of precisely estimating the carbon footprint of machine learning models.
arXiv Detail & Related papers (2022-11-03T17:13:48Z) - Environmental Claim Detection [6.2887102994549595]
This paper introduces the task of environmental claim detection.
We release an expert-annotated dataset and models trained on this dataset.
We find that the number of environmental claims has steadily increased since the Paris Agreement in 2015.
arXiv Detail & Related papers (2022-09-01T14:51:07Z) - Automated Estimation of Construction Equipment Emission using Inertial
Sensors and Machine Learning Models [0.0]
Construction industry is one of the main producers of greenhouse gasses (GHG)
This paper describes the development and deployment of a novel framework that uses machine learning (ML) methods to predict the level of emissions from heavy construction equipment monitored via the Internet of Things (IoT)
Different ML algorithms were developed and compared to identify the best model to predict emission levels from inertial sensors data.
arXiv Detail & Related papers (2021-09-27T22:37:55Z) - Estimation of Corporate Greenhouse Gas Emissions via Machine Learning [1.3190581566723916]
The European Commission adopted the most ambitious package of climate impact measures in April 2021.
The ability to see the carbon footprint of companies around the world will be critical for investors to comply with the measures.
By training a machine learning model on disclosed GHG emissions, we are able to estimate the emissions of other companies globally who do not disclose their emissions.
arXiv Detail & Related papers (2021-09-09T14:50:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.