OpenDP Community Meeting 2020

 

On May 13th - 15th, 2020, we held an online workshop to share detailed plans for OpenDP and obtain community feedback on them.  We covered topics such as the programming framework, governance, system integrations, use cases, statistical functionality, and collaborations.  The content of the following documents were discussed during this workshop:

OpenDP Whitepaper, May 2020

OpenDP Programming Framework, May 2020

Overview of OpenDP and Goals for Meeting

Speakers: Gary King, Open DP Faculty Director and Albert J. Weatherhead III University Professor, Director, IQSS, Harvard University

Salil Vadhan, OpenDP Faculty Director and Vicky Joseph Professor of Computer Science & Applied Mathematics, Harvard University

Annie Wu, Program Director for the Privacy Tools Project and OpenDP

We will give an overview of the plans and timeline for OpenDP and the goals for the workshop.

Slides

Use Cases

Speakers: Merce Crosas, University Research Data Management Officer, HUIT, Chief Data Science and Technology Officer, IQSS, Harvard University

Ilya Mironov, Research Scientist, Facebook AI

The goal of OpenDP is to help open up otherwise siloed and sequestered sensitive data to support scientifically oriented research and exploration in the public interest. To this end, we will discuss several concrete use cases for OpenDP, which include archival data repositories, government agencies, and companies sharing data with researchers.

Slides

Differential Privacy for COVID-19 Data

Speakers: Miguel Guevara, Product Manager, Google

Bryant Gipson, Engineering Manager, Google

We will give an overview of the three year journey and DP infrastructure that made it possible to launch Google's first-ever DP product (from the ground-up) in record time to address the Covid-19 crisis. We first present an overview of the reports, and continue by showing the different building blocks that are critical aspects of a robust DP infrastructure. We conclude with some insights and learnings we obtained.

Programming Framework for the DP Library

Speakers: Marco Gaboardi, Assistant Professor, Computer Science, Boston University

Michael Hay, Associate Professor, Computer Science, Colgate University

At the core of the OpenDP Commons will be a library of differentially private algorithms that will be contributed and vetted by the OpenDP community. We will propose a programming framework for this library, and discuss how it seeks to support extensibility, flexibility, programmability, modularity, usability, efficiency, and utility.

 

Slides

Statistical Functionality

Speaker: James Honaker, Chief Privacy Engineer and Research Associate, Privacy Tools Project and OpenDP, Harvard University

We will discuss what statistical methods and measures of statistical utility and validity we plan to prioritize and support in OpenDP. We will also cover what library and system components and features make for easier adoption by statisticians, data scientists and other applied analysts.

Slides

System Integrations

Speakers: James Honaker, Chief Privacy Engineer and Research Associate, Privacy Tools Project and OpenDP, Harvard University

Joshua Allen, Principal Data Scientist, Microsoft

For differential privacy software to be useful in applications, it must integrate with end-to-end systems, which may vary in their back-end storage and compute capabilities and in the needs and expertise of end users. We will give several examples of the kinds of systems we aim to support with OpenDP, including a system being developed in collaboration with Microsoft."

Slides

Governance and Licensing

Speakers: Chris Bavitz, WilmerHale Clinical Professor of Law, Harvard Law School and Managing Director, Cyberlaw Clinic at the Berkman Klein Center for Internet and Society

Annie Wu, Program Director for the Privacy Tools Project and OpenDP

We will describe our plans for the governance of OpenDP, including both community vetting of the software in the OpenDP Commons as well as a future transition to community governance of the project as a whole. Other issues to be discussed include the code of conduct and licensing.

Slides

Collaborations

Speakers: Merce Crosas, University Research Data Management Officer, HUIT, Chief Data Science and Technology Officer, IQSS, Harvard University

Sarah Bird, Principal Program Manager, Microsoft

We aim for OpenDP to have a wide array of partners in industry, government, and academia, including organizations that will contribute to or make use of the OpenDP software. We will discuss several models for how we can carry out such collaborations.

Slides

 

The Programming Framework for the DP Library

Discussion Leads:

Marco Gaboardi, Assistant Professor, Computer Science, Boston University

Michael Hay, Associate Professor, Computer Science, Colgate University

This breakout group will discuss the programming framework for the DP library as presented in the plenary session and detailed in a whitepaper to be distributed before the workshop. We seek to obtain feedback from DP researchers and software engineers who may contribute to the DP library, and those who have experience with algorithmic libraries in other software projects.