GreenDB: Toward a Product-by-Product Sustainability Database
        - URL: http://arxiv.org/abs/2205.02908v1
- Date: Thu, 5 May 2022 20:24:16 GMT
- Title: GreenDB: Toward a Product-by-Product Sustainability Database
- Authors: Sebastian J\"ager, Jessica Greene, Max Jakob, Ruben Korenke, Tilman
  Santarius, Felix Biessmann
- Abstract summary: Modern retail platforms rely heavily on Machine Learning (ML) for their search and recommender systems.
No open and publicly available database integrates sustainability information on a product-by-product basis.
We present our proof of concept implementation of a scraping system that creates the GreenDB dataset.
- Score: 2.9971739294416717
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract:   The production, shipping, usage, and disposal of consumer goods have a
substantial impact on greenhouse gas emissions and the depletion of resources.
Modern retail platforms rely heavily on Machine Learning (ML) for their search
and recommender systems. Thus, ML can potentially support efforts towards more
sustainable consumption patterns, for example, by accounting for sustainability
aspects in product search or recommendations. However, leveraging ML potential
for reaching sustainability goals requires data on sustainability.
Unfortunately, no open and publicly available database integrates
sustainability information on a product-by-product basis. In this work, we
present the GreenDB, which fills this gap. Based on search logs of millions of
users, we prioritize which products users care about most. The GreenDB schema
extends the well-known schema.org Product definition and can be readily
integrated into existing product catalogs to improve sustainability information
available for search and recommendation experiences. We present our proof of
concept implementation of a scraping system that creates the GreenDB dataset.
 
      
        Related papers
        - Insights Informed Generative AI for Design: Incorporating Real-world   Data for Text-to-Image Output [51.88841610098437]
 We propose a novel pipeline that integrates DALL-E 3 with a materials dataset to enrich AI-generated designs with sustainability metrics and material usage insights.<n>We evaluate the system through three user tests: (1) no mention of sustainability to the user prior to the prompting process with generative AI, (2) sustainability goals communicated to the user before prompting, and (3) sustainability goals communicated along with quantitative CO2e data included in the generative AI outputs.
 arXiv  Detail & Related papers  (2025-06-17T22:33:11Z)
- Towards a Knowledge Base of Common Sustainability Weaknesses in Green   Software Development [9.521952718902973]
 In this paper, we motivate the need for the development of a standard knowledge base of commonly occurring sustainability weaknesses in code.<n>We demonstrate why existing knowledge regarding software weaknesses cannot be re-tagged "as is" to sustainability without significant due diligence.
 arXiv  Detail & Related papers  (2025-06-10T14:03:58Z)
- LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical   Relationship Preservation [49.898152180805454]
 This study is the first to explicitly address inter-column relationship preservation in synthetic tabular data generation.
LLM-TabFlow is a novel approach that captures complex inter-column relationships and compress data, while using Score-based Diffusion to model the distribution of the compressed data in latent space.
Our results show that LLM-TabFlow outperforms all baselines, fully preserving inter-column relationships while achieving the best balance between data fidelity, utility, and privacy.
 arXiv  Detail & Related papers  (2025-03-04T00:47:52Z)
- Building Knowledge Graphs Towards a Global Food Systems Datahub [0.9752919973942652]
 There is a lack of studies that comprehensively examine sustainable agricultural practices across various products and production methods.
We are building a set of KN and Knowledge Graphs (KGs) that encode knowledge associated with sustainable wheat production.
 arXiv  Detail & Related papers  (2025-02-26T19:13:11Z)
- Entity Linking using LLMs for Automated Product Carbon Footprint   Estimation [4.423169535332588]
 Growing concerns about climate change and sustainability are driving manufacturers to take significant steps toward reducing their carbon footprints.
For these manufacturers, a first step towards this goal is to identify the environmental impact of the individual components of their products.
We propose a system leveraging large language models (LLMs) to automatically map components from manufacturer Bills of Materials (BOMs) to Life Cycle Assessment (LCA) database entries.
 arXiv  Detail & Related papers  (2025-02-11T09:54:39Z)
- Unveiling User Satisfaction and Creator Productivity Trade-Offs in   Recommendation Platforms [68.51708490104687]
 We show that a purely relevance-driven policy with low exploration strength boosts short-term user satisfaction but undermines the long-term richness of the content pool.
Our findings reveal a fundamental trade-off between immediate user satisfaction and overall content production on platforms.
 arXiv  Detail & Related papers  (2024-10-31T07:19:22Z)
- DiscoveryBench: Towards Data-Driven Discovery with Large Language Models [50.36636396660163]
 We present DiscoveryBench, the first comprehensive benchmark that formalizes the multi-step process of data-driven discovery.
Our benchmark contains 264 tasks collected across 6 diverse domains, such as sociology and engineering.
Our benchmark, thus, illustrates the challenges in autonomous data-driven discovery and serves as a valuable resource for the community to make progress.
 arXiv  Detail & Related papers  (2024-07-01T18:58:22Z)
- System Support for Environmentally Sustainable Computing in Data Centers [4.774769264608661]
 Modern data centers suffer from a growing carbon footprint due to insufficient support for environmental sustainability.
We present our preliminary results and recognize this as an ongoing initiative with significant potential to advance environmentally sustainable computing in data centers.
 arXiv  Detail & Related papers  (2024-03-19T12:56:02Z)
- Learn to Code Sustainably: An Empirical Study on LLM-based Green Code
  Generation [7.8273713434806345]
 We evaluate the sustainability of auto-generate codes produced by generative commercial AI language models.
We compare the performance and green capacity of human-generated code and code generated by the three AI language models.
 arXiv  Detail & Related papers  (2024-03-05T22:12:01Z)
- Reliable, Adaptable, and Attributable Language Models with Retrieval [144.26890121729514]
 Parametric language models (LMs) are trained on vast amounts of web data.
They face practical challenges such as hallucinations, difficulty in adapting to new data distributions, and a lack of verifiability.
We advocate for retrieval-augmented LMs to replace parametric LMs as the next generation of LMs.
 arXiv  Detail & Related papers  (2024-03-05T18:22:33Z)
- Potentials of Green Coding -- Findings and Recommendations for Industry,
  Education and Science -- Extended Paper [0.0]
 We conduct an analysis to gather and present existing literature on three research questions relating to the production of ecologically sustainable software.
We compile the approaches to Green Coding and Green Software Engineering that have been published since 2010.
We consider ways to integrate the findings into existing industrial processes and higher education curricula to influence future development in an environmentally friendly way.
 arXiv  Detail & Related papers  (2024-02-28T10:48:56Z)
- EASRec: Elastic Architecture Search for Efficient Long-term Sequential
  Recommender Systems [82.76483989905961]
 Current Sequential Recommender Systems (SRSs) suffer from computational and resource inefficiencies.
We develop the Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems (EASRec)
EASRec introduces data-aware gates that leverage historical information from input data batch to improve the performance of the recommendation network.
 arXiv  Detail & Related papers  (2024-02-01T07:22:52Z)
- GreenDB -- A Dataset and Benchmark for Extraction of Sustainability
  Information of Consumer Goods [58.31888171187044]
 We present GreenDB, a database that collects products from European online shops on a weekly basis.
As proxy for the products' sustainability, it relies on sustainability labels, which are evaluated by experts.
We present initial results demonstrating that ML models trained with our data can reliably predict the sustainability label of products.
 arXiv  Detail & Related papers  (2022-07-21T19:59:42Z)
- Can Machine Learning Tools Support the Identification of Sustainable
  Design Leads From Product Reviews? Opportunities and Challenges [0.0]
 This paper aims to develop an integrated machine learning solution to obtain sustainable design insights from online product reviews automatically.
The opportunities and challenges offered by existing frameworks are discussed, illustrated, and positioned along an ad hoc machine learning process.
 arXiv  Detail & Related papers  (2021-12-17T08:53:58Z)
- SustainBench: Benchmarks for Monitoring the Sustainable Development
  Goals with Machine Learning [63.192289553021816]
 Progress toward the United Nations Sustainable Development Goals has been hindered by a lack of data on key environmental and socioeconomic indicators.
Recent advances in machine learning have made it possible to utilize abundant, frequently-updated, and globally available data, such as from satellites or social media.
In this paper, we introduce SustainBench, a collection of 15 benchmark tasks across 7 SDGs.
 arXiv  Detail & Related papers  (2021-11-08T18:59:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.