My research focuses broadly on data-oriented systems and the way they drive computing. Recently this includes distributed programming models, serverless computing, distributed consistency and isolation, data management for machine learning and data science, interactive data visualization and transformation, and query processing.
My research is driven by collaborations with colleagues in a wide variety of fields including Programming Languages, Human-Computer Interaction, AI, Networking, Security, and Theoretical Computer Science.
Distributed Systems: The Hydro project is developing new techniques for the programmable cloud. Sub-projects include:
Data Management for Machine Learning: The machine learning lifecycle presents many data management problems.
Interactive Data Visualization: Data visualization systems merge language design, data processing and asynchronous event processing in service of human-centric data interaction. Current projects include:
BOOM and : Orders Of Magnitude simpler code for the Cloud.
BayesStore: Probabilistic data management
PIER: A peer-to-peer query engine based on distributed hash table (DHT) overlay technologies.
Telegraph: An Adaptive Dataflow System for networked data and services.
TinyDB: A query processing engine for ad-hoc wireless sensor networks.
Open Source Software