SustainBench: Benchmarks for Monitoring the Sustainable Development
  Goals with Machine Learning
        - URL: http://arxiv.org/abs/2111.04724v1
- Date: Mon, 8 Nov 2021 18:59:04 GMT
- Title: SustainBench: Benchmarks for Monitoring the Sustainable Development
  Goals with Machine Learning
- Authors: Christopher Yeh, Chenlin Meng, Sherrie Wang, Anne Driscoll, Erik Rozi,
  Patrick Liu, Jihyeon Lee, Marshall Burke, David B. Lobell, Stefano Ermon
- Abstract summary: Progress toward the United Nations Sustainable Development Goals has been hindered by a lack of data on key environmental and socioeconomic indicators.
Recent advances in machine learning have made it possible to utilize abundant, frequently-updated, and globally available data, such as from satellites or social media.
In this paper, we introduce SustainBench, a collection of 15 benchmark tasks across 7 SDGs.
- Score: 63.192289553021816
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Progress toward the United Nations Sustainable Development Goals (SDGs) has
been hindered by a lack of data on key environmental and socioeconomic
indicators, which historically have come from ground surveys with sparse
temporal and spatial coverage. Recent advances in machine learning have made it
possible to utilize abundant, frequently-updated, and globally available data,
such as from satellites or social media, to provide insights into progress
toward SDGs. Despite promising early results, approaches to using such data for
SDG measurement thus far have largely evaluated on different datasets or used
inconsistent evaluation metrics, making it hard to understand whether
performance is improving and where additional research would be most fruitful.
Furthermore, processing satellite and ground survey data requires domain
knowledge that many in the machine learning community lack. In this paper, we
introduce SustainBench, a collection of 15 benchmark tasks across 7 SDGs,
including tasks related to economic development, agriculture, health,
education, water and sanitation, climate action, and life on land. Datasets for
11 of the 15 tasks are released publicly for the first time. Our goals for
SustainBench are to (1) lower the barriers to entry for the machine learning
community to contribute to measuring and achieving the SDGs; (2) provide
standard benchmarks for evaluating machine learning models on tasks across a
variety of SDGs; and (3) encourage the development of novel machine learning
methods where improved model performance facilitates progress towards the SDGs.
 
      
        Related papers
        - Toward Generalizable Evaluation in the LLM Era: A Survey Beyond   Benchmarks [229.73714829399802]
 This survey probes the core challenges that the rise of Large Language Models poses for evaluation.
We identify and analyze two pivotal transitions: (i) from task-specific to capability-based evaluation, which reorganizes benchmarks around core competencies such as knowledge, reasoning, instruction following, multi-modal understanding, and safety.
We will dissect this issue, along with the core challenges of the above two transitions, from the perspectives of methods, datasets, evaluators, and metrics.
 arXiv  Detail & Related papers  (2025-04-26T07:48:52Z)
- G-OSR: A Comprehensive Benchmark for Graph Open-Set Recognition [54.45837774534411]
 We introduce textbfG-OSR, a benchmark for evaluating Graph Open-Set Recognition (GOSR) methods at both the node and graph levels.
Results offer critical insights into the generalizability and limitations of current GOSR methods.
 arXiv  Detail & Related papers  (2025-03-01T13:02:47Z)
- Sustainable Visions: Unsupervised Machine Learning Insights on Global   Development Goals [0.3764231189632788]
 The United Nations 2030 Agenda for Sustainable Development outlines 17 goals to address global challenges.
Progress toward the SDGs is heavily influenced by geographical, cultural and socioeconomic factors.
No country on track to achieve all goals by 2030.
 arXiv  Detail & Related papers  (2024-09-19T03:10:49Z)
- Leveraging Artificial Intelligence Technology for Mapping Research to
  Sustainable Development Goals: A Case Study [6.551575555269426]
 This study employed over 82,000 publications from an Australian university as a case study.
We utilized a similarity measure to map these publications onto Sustainable Development Goals.
We leveraged the OpenAI GPT model to conduct the same task, facilitating a comparative analysis between the two approaches.
 arXiv  Detail & Related papers  (2023-11-09T11:44:22Z)
- Harnessing the Web and Knowledge Graphs for Automated Impact Investing
  Scoring [2.4107880640624706]
 We describe a data-driven system that seeks to automate the process of creating an Sustainable Development Goals framework.
We propose a novel method for collecting and filtering a dataset of texts from different web sources and a knowledge graph relevant to a set of companies.
Our results indicate that our best performing model can accurately predict SDG scores with a micro average F1 score of 0.89.
 arXiv  Detail & Related papers  (2023-08-04T15:14:16Z)
- GEO-Bench: Toward Foundation Models for Earth Monitoring [139.77907168809085]
 We propose a benchmark comprised of six classification and six segmentation tasks.
This benchmark will be a driver of progress across a variety of Earth monitoring tasks.
 arXiv  Detail & Related papers  (2023-06-06T16:16:05Z)
- DataPerf: Benchmarks for Data-Centric AI Development [81.03754002516862]
 DataPerf is a community-led benchmark suite for evaluating ML datasets and data-centric algorithms.
We provide an open, online platform with multiple rounds of challenges to support this iterative development.
The benchmarks, online evaluation platform, and baseline implementations are open source.
 arXiv  Detail & Related papers  (2022-07-20T17:47:54Z)
- Benchmarking high-fidelity pedestrian tracking systems for research,
  real-time monitoring and crowd control [55.41644538483948]
 High-fidelity pedestrian tracking in real-life conditions has been an important tool in fundamental crowd dynamics research.
As this technology advances, it is becoming increasingly useful also in society.
To successfully employ pedestrian tracking techniques in research and technology, it is crucial to validate and benchmark them for accuracy.
We present and discuss a benchmark suite, towards an open standard in the community, for privacy-respectful pedestrian tracking techniques.
 arXiv  Detail & Related papers  (2021-08-26T11:45:26Z)
- Seeing poverty from space, how much can it be tuned? [0.0]
 We demonstrate that individuals with no organizational affiliation can participate in the improvement of predicting local poverty levels in a given agro-ecological environment.
The approach builds upon several pioneering efforts related to mapping poverty by deep learning to process satellite imagery and "ground-truth" data from the field.
A key goal of the project was to intentionally keep costs as low as possible - by using freely available resources - so that citizen scientists, students and organizations could replicate the method in other areas of interest.
 arXiv  Detail & Related papers  (2021-07-30T15:23:54Z)
- DAGA: Data Augmentation with a Generation Approach for Low-resource
  Tagging Tasks [88.62288327934499]
 We propose a novel augmentation method with language models trained on the linearized labeled sentences.
Our method is applicable to both supervised and semi-supervised settings.
 arXiv  Detail & Related papers  (2020-11-03T07:49:15Z)
- Using satellite imagery to understand and promote sustainable
  development [87.72561825617062]
 We synthesize the growing literature that uses satellite imagery to understand sustainable development outcomes.
We quantify the paucity of ground data on key human-related outcomes and the growing abundance and resolution of satellite imagery.
We review recent machine learning approaches to model-building in the context of scarce and noisy training data.
 arXiv  Detail & Related papers  (2020-09-23T05:20:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.