This page highlights several of my research and personal projects. For a complete list of my publications, see my publications page.

Research

SustainGym: Reinforcement Learning Environments for Sustainable Energy Systems

Caltech, 2022 - 2023

The lack of standardized benchmarks for reinforcement learning (RL) in sustainability applications has made it difficult to both track progress on specific domains and identify bottlenecks for researchers to focus their efforts. In this paper, we present SustainGym, a suite of five environments designed to test the performance of RL algorithms on realistic sustainable energy system tasks, ranging from electric vehicle charging to carbon-aware data center job scheduling. The environments test RL algorithms under realistic distribution shifts as well as in multi-agent settings. We show that standard off-the-shelf RL algorithms leave significant room for improving performance and highlight the challenges ahead for introducing RL to real-world sustainability tasks.

Collaborators: Victor Li, Rajeev Datta, Julio Arroyo Ibarra, Nicolas Christianson, Chi Zhang, Yize Chen, Mohammad Mehdi Hosseini, Azarang Golmohammadi, Yuanyuan Shi, Yisong Yue, Adam Wierman

Publications:

C. Yeh, V. Li, R. Datta, J. Arroyo, N. Christianson, C. Zhang, Y. Chen, M. Hosseini, A. Golmohammadi, Y. Shi, Y. Yue, and A. Wierman, “SustainGym: A Benchmark Suite of Reinforcement Learning for Sustainability Applications,” in Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track, New Orleans, LA, USA, Dec. 2023. [Online]. Available: https://openreview.net/forum?id=vZ9tA3o3hr.
C. Yeh, V. Li, R. Datta, Y. Yue, and A. Wierman, “SustainGym: A Benchmark Suite of Reinforcement Learning for Sustainability Applications,” in NeurIPS 2022 Workshop on Tackling Climate Change with Machine Learning, Dec. 2022. [Online]. Available: https://www.climatechange.ai/papers/neurips2022/38.

Paper (NeurIPS 2023) Project Page Code

Our voltage control method combines a consistent model chasing algorithm with a robust control oracle to achieve a finite-mistake guarantee even when the distribution grid topology is unknown.

Online learning for robust voltage control under uncertain grid topology

Caltech, 2022 - 2024

Voltage control generally requires accurate information about the grid’s topology in order to guarantee network stability. However, accurate topology identification is challenging for existing methods, especially as the grid is subject to increasingly frequent reconfiguration due to the adoption of renewable energy. In this work, we combine a nested convex body chasing algorithm with a robust predictive controller to achieve provably finite-time convergence to safe voltage limits in the online setting where there is uncertainty in both the network topology as well as load and generation variations. In an online fashion, our algorithm narrows down the set of possible grid models that are consistent with observations and adjusts reactive power generation accordingly to keep voltages within desired safety limits. Our approach can also incorporate existing partial knowledge of the network to improve voltage control performance. We demonstrate the effectiveness of our approach in a case study on a Southern California Edison 56-bus distribution system. Our experiments show that in practical settings, the controller is indeed able to narrow the set of consistent topologies quickly enough to make control decisions that ensure stability in both linearized and realistic non-linear models of the distribution grid.

Collaborators: Jing Yu, Yuanyuan Shi, Adam Wierman

Publications:

C. Yeh, J. Yu, Y. Shi, and A. Wierman, “Online learning for robust voltage control under uncertain grid topology,” IEEE Transactions on Smart Grid, vol. 15, no. 5, pp. 4754-4764, Sep. 2024, ISSN: 1949-3061. DOI: 10.1109/TSG.2024.3383804. [Online]. Available: https://ieeexplore.ieee.org/document/10486962.
C. Yeh, J. Yu, Y. Shi, and A. Wierman, “Robust online voltage control with an unknown grid topology,” in Proceedings of the Thirteenth ACM International Conference on Future Energy Systems (e-Energy ‘22), Association for Computing Machinery, Jun. 2022, pp. 240–250, ISBN: 9781450393973. DOI: 10.1145/3538637.3538853. [Online]. Available: https://dl.acm.org/doi/10.1145/3538637.3538853.

Paper (IEEE Transactions on Smart Grid) Presentation arXiv Code

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Caltech, 2020 - 2021

Progress toward the United Nations Sustainable Development Goals (SDGs) has been hindered by a lack of data on key environmental and socioeconomic indicators, which historically have come from ground surveys with sparse temporal and spatial coverage. Recent advances in machine learning have made it possible to utilize abundant, frequently-updated, and globally available data, such as from satellites or social media, to provide insights into progress toward SDGs. Despite promising early results, approaches to using such data for SDG measurement thus far have largely evaluated on different datasets or used inconsistent evaluation metrics, making it hard to understand whether performance is improving and where additional research would be most fruitful. Furthermore, processing satellite and ground survey data requires domain knowledge that many in the machine learning community lack. In this paper, we introduce SustainBench, a collection of 15 benchmark tasks across 7 SDGs, including tasks related to economic development, agriculture, health, education, water and sanitation, climate action, and life on land. Datasets for 11 of the 15 tasks are released publicly for the first time. Our goals for SustainBench are to (1) lower the barriers to entry for the machine learning community to contribute to measuring and achieving the SDGs; (2) provide standard benchmarks for evaluating machine learning models on tasks across a variety of SDGs; and (3) encourage the development of novel machine learning methods where improved model performance facilitates progress towards the SDGs.

Mentors: Prof. Stefano Ermon, Prof. Marshall Burke, Prof. David Lobell

Collaborators: Chenlin Meng, Sherrie Wang, Anne Driscoll, Erik Rozi, Patrick Liu, Jihyeon Lee

Publication: C. Yeh, C. Meng, S. Wang, A. Driscoll, E. Rozi, P. Liu, J. Lee, M. Burke, D. B. Lobell, and S. Ermon, “SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning,” in Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks, Dec. 2021. [Online]. Available: https://datasets-benchmarks-proceedings.neurips.cc/paper_files/paper/2021/hash/950a4152c2b4aa3ad78bdd6b366cc179-Abstract-round2.html.

Paper (NeurIPS 2021) Presentation Project Page Code

In both active learning and core-set selection settings, our “selection via proxy” approach uses a smaller proxy model to select data for a larger target model.

Selection via Proxy: Efficient Data Selection for Deep Learning

Stanford University, Summer 2018 - 2019

Data selection methods, such as active learning and core-set selection, are useful tools for machine learning on large datasets. However, they can be prohibitively expensive to apply in deep learning because they depend on feature representations that need to be learned. We show that we can greatly improve the computational efficiency by using a small proxy model to perform data selection (e.g., selecting data points to label for active learning). By removing hidden layers from the target model, using smaller architectures, and training for fewer epochs, we create proxies that are an order of magnitude faster to train. Although these small proxy models have higher error rates, we find that they empirically provide useful signals for data selection. We evaluate this “selection via proxy” (SVP) approach on several data selection tasks across five datasets: CIFAR10, CIFAR100, ImageNet, Amazon Review Polarity, and Amazon Review Full. For active learning, applying SVP can give an order of magnitude improvement in data selection runtime (i.e., the time it takes to repeatedly train and select points) without significantly increasing the final error (often within 0.1%). For core-set selection on CIFAR10, proxies that are over 10x faster to train than their larger, more accurate targets can remove up to 50% of the data without harming the final accuracy of the target, leading to a 1.6x end-to-end training time improvement.

Mentors: Prof. Matei Zaharia, Prof. Peter Bailis, Prof. Jure Leskovec, Prof. Percy Liang

Collaborators: Cody Coleman, Stephen Mussmann, Baharan Mirzasoleiman

Publications:

C. Coleman, C. Yeh, S. Mussmann, B. Mirzasoleiman, P. Bailis, P. Liang, J. Leskovec, and M. Zaharia, “Selection via Proxy: Efficient Data Selection for Deep Learning,” in International Conference on Learning Representations, Apr. 2020. [Online]. Available: https://openreview.net/forum?id=HJg2b0VYDr.
C. Coleman, C. Yeh, S. Mussmann, B. Mirzasoleiman, P. Bailis, P. Liang, J. Leskovec, and M. Zaharia, “Selection via Proxy: Increasing the Computational Efficiency of Deep Active Learning,” in Practical Machine Learning for Developing Countries Workshop at ICLR 2020, Apr. 2020. [Online]. Available: https://pml4dc.github.io/iclr2020/program/pml4dc_25.html.

Paper (ICLR 2020) Presentation Blog Post Code

Nighttime light satellite imagery provides valuable information about economic development.

Deep learning for understanding economic well-being in Africa from publicly available satellite imagery

Stanford University, Sustainability and AI Lab, Summer 2017 - 2020

Accurate and comprehensive measurements of economic well-being are fundamental inputs into both research and policy, but such measures are unavailable at a local level in many parts of the world. We train deep learning models to predict survey-based estimates of asset wealth across ~20,000 African villages from publicly-available multispectral daytime and nightlight satellite imagery with broad temporal and spatial coverage. Models are able to explain 70% of the variation in ground-measured village wealth in countries where the model was not trained, outperforming previous benchmarks from high-resolution imagery. Comparison with independent wealth measurements from censuses suggests that errors in satellite estimates are comparable to errors in existing ground data. Validating estimates of temporal changes in wealth across ~1,500 villages is also hampered by noise in training data, but district-aggregated satellite-based estimates explain up to 50% of the variation in ground-estimated changes in wealth over time, with daytime imagery particularly useful in this task. We quantitatively demonstrate the utility of satellite-based estimates for research and policy, and demonstrate their scalability by creating a wealth map for Africa’s most populous country.

Mentors: Prof. Stefano Ermon, Prof. Marshall Burke, Prof. David Lobell

Collaborators: Anthony Perez, Anne Driscoll, George Azzari, Zhongyi Tang

Publications

A. Perez, C. Yeh, G. Azzari, M. Burke, D. Lobell, and S. Ermon, “Poverty Prediction with Public Landsat 7 Satellite Imagery and Machine Learning,” in NIPS 2017 Workshop on Machine Learning for the Developing World, Long Beach, CA, USA, Dec. 2017. arXiv:1711.03654. [Online]. Available: https://arxiv.org/abs/1711.03654.
C. Yeh, A. Perez, A. Driscoll, G. Azzari, Z. Tang, D. Lobell, S. Ermon, and M. Burke, “Using publicly available satellite imagery and deep learning to understand economic well-being in Africa,” Nature Communications, vol. 11, no. 1, May 2020, ISSN: 2041-1723. DOI: 10.1038/s41467-020-16185-w. [Online]. Available: https://www.nature.com/articles/s41467-020-16185-w.
C. Yeh, A. Perez, A. Driscoll, G. Azzari, Z. Tang, D. Lobell, S. Ermon, and M. Burke, “Deep learning for understanding economic well-being in Africa from publicly available satellite imagery,” in Workshop on Machine Learning for Economic Policy at NeurIPS 2020, Dec. 2020. [Online]. Available: http://www.mlforeconomicpolicy.com/papers/MLEconPolicy20_paper_30.pdf.

Paper (Nature Communications, May 2020) Poster (NeurIPS 2020 Workshop) Code

Estimated stereo disparity map from our model on the “Art” dataset from the Middlebury Stereo Vision Page.

Conditional Random Fields for Dense Stereo Matching

UC Irvine, Summer 2012 - Summer 2014

Various algorithms have been developed over the past two decades for solving the stereo correspondence problem, which is defined as the identification of the offset or disparity of an object in a pair of stereo images. Recent work has shown that conditional random fields (CRFs) have the potential to be faster and more accurate than traditional local matching algorithms. The canonical CRF for solving dense stereo matching problems uses a basic energy function that accounts for both local intensity matching and smoothness costs. Traditionally, the smoothness term relies on a binary Potts Model which fails to assign different costs to different disparities. In this paper, we extend the smoothness term in the energy function to be more robust. Specifically, we explore using a logarithmic function modulated by discrete edge gradient bins and binary edge detection features. The logarithmic function is able to distinguish between different disparities and therefore assign more appropriate costs. Our results suggest that our algorithm exceeds the performance of the traditional smoothness term based on a Potts Model. However, further optimization in our CRF evaluation process is necessary to achieve real-time outputs.

Mentor: Prof. Alex Ihler

Presented at 2013 Southern California Conference for Undergraduate Research (SCCUR) at Whittier College, CA.

Slides Presentation

Foam fractionation of a dilute solution of bovine lactoferrin into a glass beaker.

Effect of Aging on the Foam Fractionation of Lactoferrin

Caltech, Summer 2011

Foam fractionation is an inexpensive and simple technique for concentrating proteins. The foamability of a protein can drastically change with the age of the protein. The foamability of solutions created from ten year old bovine lactoferrin (bLF) protein was investigated with varying concentration protein, air flow velocity, and the pH of the solution. The results suggest the foamability of the aged protein decreased to an insignificant level except at high pH with a protein concentration of 0.1 mg/mL.

Mentor: Prof. Robert Tanner, Prof. Julia Kornfield

Collaborators: Benjamin Yeh, Yuehan Huang

Presented at 43rd American Chemistry Society Western Regional Meeting, Pasadena, CA.

Projects

Research

SustainGym: Reinforcement Learning Environments for Sustainable Energy Systems

Online learning for robust voltage control under uncertain grid topology

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Selection via Proxy: Efficient Data Selection for Deep Learning

Deep learning for understanding economic well-being in Africa from publicly available satellite imagery

Conditional Random Fields for Dense Stereo Matching

Effect of Aging on the Foam Fractionation of Lactoferrin

Project

Photo Licensing Platform

Mood Music Firefox Add-on