Timezone: »

 
SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning
Christopher Yeh · Chenlin Meng · Sherrie Wang · Anne Driscoll · Erik Rozi · Patrick Liu · Jihyeon Lee · Marshall Burke · David Lobell · Stefano Ermon

Progress toward the United Nations Sustainable Development Goals (SDGs) has been hindered by a lack of data on key environmental and socioeconomic indicators, which historically have come from ground surveys with sparse temporal and spatial coverage. Recent advances in machine learning have made it possible to utilize abundant, frequently-updated, and globally available data, such as from satellites or social media, to provide insights into progress toward SDGs. Despite promising early results, approaches to using such data for SDG measurement thus far have largely evaluated on different datasets or used inconsistent evaluation metrics, making it hard to understand whether performance is improving and where additional research would be most fruitful. Furthermore, processing satellite and ground survey data requires domain knowledge that many in the machine learning community lack. In this paper, we introduce SustainBench, a collection of 15 benchmark tasks across 7 SDGs, including tasks related to economic development, agriculture, health, education, water and sanitation, climate action, and life on land. Datasets for 11 of the 15 tasks are released publicly for the first time. Our goals for SustainBench are to (1) lower the barriers to entry for the machine learning community to contribute to measuring and achieving the SDGs; (2) provide standard benchmarks for evaluating machine learning models on tasks across a variety of SDGs; and (3) encourage the development of novel machine learning methods where improved model performance facilitates progress towards the SDGs.

Author Information

Christopher Yeh (Caltech)

Christopher Yeh is a PhD student at Caltech studying computing and mathematical sciences, with a specialization in machine learning. His research focuses on computational sustainability and active learning.

Chenlin Meng (Stanford University)
Sherrie Wang (Stanford University)
Anne Driscoll (Stanford University)
Erik Rozi (Stanford University)
Patrick Liu (Stanford University)
Jihyeon Lee (Google Research, Stanford University)

Jihyeon is an incoming software engineer at Google Research. She graduated with a BS and MS in Computer Science from Stanford University, where she was a researcher in the Sustain Lab advised by Professor Stefano Ermon and in the Vision & Learning Lab advised by Professor Fei-Fei Li. She is interested in semi-supervised learning and AI methods to solve problems at the intersection of environment, policy, and people.

Marshall Burke (Stanford University)
David Lobell (Stanford University)
Stefano Ermon (Stanford)

More from the Same Authors