Agenda

Wednesday June 17 - Plenary Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)
 
9:45am - 10:00am
Coffee via Zoom
 
10:00am - 10:10am
Welcome
Mercè Crosas (Chief Data Science and Technology Officer, IQSS)
 
10:10am – 10:30am
Global COVID-19 Data Sharing
Short presentations of some of the popular datasets on COVID-19 related data shared publicly using Dataverse.
European COVID-19 Datasets - Vyacheslav Tykhonov (Senior Data Scientist, DANS)
China Data Lab - Resources for COVID-19 – Wendy Guan (Executive Director, Center for Geographic Analysis, Harvard University)
COVID-19 data from Population Council - Thoai Ngo (Director, Poverty, Gender, and Youth program, Population Council)
 
10:30am - 11:30am
Panel on Improving and Standardizing COVID-19 Data Sharing
This panel will include recommendations on sharing COVID-19 data from three perspectives: from the Research Data Alliance community, from the National Institutes of Health, and from researchers who work on understanding the effectiveness of interventions.
Hilary Hanahoe (Secretary General Research Data Alliance): Recommendations from RDA working group
Susan Gregurick (Director of the Office of Data Science Strategy): Recommendations from NIH
Caroline Buckee and Satchit Balsari (Harvard School of Public Health); Recommendations and needs from researchers
 
11:30am – 11:45am
Break
 
11:45am – 12:15pm
Dataverse New Features and Future Plans
 
12:15pm - 1:15pm
Lightning Video presentations from Dataverse Repositories World-wide
 
1:15pm - 1:30pm
#Dataverse2020: What's Ahead on Thursday and Friday

Thursday June 18 - Breakout Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

8:00am - 9:30am
Core Trust Seal 
Ceilyn Boyd (Harvard Library, Harvard University)
Philipp Conzett (Senior Research Librarian, UiT The Arctic University of Norway)
Sebastian Karcher (Associate Director, Qualitative Data Repository)
Sonia Barbosa (IQSS, Harvard University)
The Core Trust Seal (CTS, https://www.coretrustseal.org/) is the most widely used certification for data repositories. CTS is made up of a peer-reviewed self-assessment of a repository’s facilities, organization, and policies. Several of CTS’s requirements can be met based on Dataverse’s existing capabilities. Moreover, a number of repositories in the Dataverse community have already obtained the certification and can share their experience. Whether you are considering applying for CTS certification or whether you’ve already started the process, this session is for you!

8:00am - 9:30am
GDCC/Governance Session 2
Mercè Crosas (Chief Data Science and Technology Officer, IQSS)
Jonathan Crabtree (Assistant Director of Cyberinfrastructure, Odum Institute UNC Chapel Hill)
The session is open to all existing Global Dataverse Community Consortium (GDCC) members and others interested in becoming a member or learning more about GDCC. It will address the following topics:
•    Update on DataCite new fees model and how it affects some GDCC members
•    Changes in GDCC membership from fiscal year to calendar year
•    Governance of GDCC and community contributions, including the proposal for a new external advisory board
•    Current efforts in community collaboration projects
•    Community members input for prioritizing next new services
•    Q&A

Two identical sessions will be provided in order to give an opportunity for all GDCC members to participate. The other session is offered on Wednesday at 6:00pm EDT.

9:45am - 11:15am
Sensitive Data/DataTags
Merce Crosas (Harvard IQSS)
Marion Wittenberg (DANS)
Tania Schlatter (Harvard IQSS)
This session will introduce how the Dataverse software will support sensitive data, in terms of functionality and tools, and will present a broad range of requirements from the community in the form of use cases. 

11:30am - 1:00pm
DevOps/Installation Deployments/Automated Testing
Don Sizemore (Odum Institute)
Slava Tykhonov (DANS/DataverseEU)
Over the last several years, the Dataverse Project has seen a number of new initiatives in the areas of DevOps, Installation Deployments, and Automated Testing. Attendees will discuss how the Dataverse Community currently uses technologies such as Vagrant, Ansible, Docker, Kubernetes, Jenkins, and more to help with the development process and running Dataverse in production and non-production environments. After a review of the history and current processes around these tools, we’ll work collaboratively to identify new directions, technologies, and use cases to support Dataverse into the future.

1:15pm - 2:45pm 
Geospatial Data
Eugene Barsky (The University of British Columbia)
Bonnie Burns (Harvard Geospatial Library)
BFFs – Dataverse and Geospatial” – this session will bring together those working with Dataverse repository services for geospatial data as well as those exploring the possibility. The main focus is to share best practices, solve common problems and address technical issues with integrating geospatial data into a Dataverse repository and associated services. We will hear from developers of the Canadian Geodisy project, contributors to the Harvard Geospatial Library, Dataverse developers, and many others.  Please join us to learn more about how Dataverse can help you store and serve your geospatial data now, and provide valuable feedback to the Dataverse team as they move forward with new services.

3:00pm - 4:30pm
Encapsulation
Ana Trisovic (IQSS, Harvard University)
Craig Willis (WholeTale)
Sandra Gesing (University of Notre Dame)
In this session, we will have a discussion on what is an adequate and sustainable way to deposit and store virtual containers and computational workflows. This is a complicated issue because, on the one hand, it is not economical to store whole Docker images in Dataverse, and on the other hand, Dockerfiles are prone to errors. We will use recent development on Dataverse described in https://arxiv.org/abs/2005.02985 as a starting point for this session.

4:45pm - 5:45pm
New Feature Demo
Sonia Barbosa (IQSS, Harvard University)
Danny Brooke (IQSS, Harvard University)
Come and learn about the Dataverse releases and features from over the last year! 

6:00pm - 7:00pm
Flexible Metadata Session 1
Jim Myers (Global Dataverse Community Consortium)
Katie Mika (IQSS, Harvard University)
Steve McEachern (Australian Data Archive)
Dataverse has a rich and extensible set of metadata capabilities and many members of our community are interested in how to best leverage these capabilities to support researchers and in further expanding Dataverse’s metadata capabilities. This session will be separated into two, hour long breakouts to provide an opportunity to discuss what is currently possible, to learn about the range of potential new features being proposed, and to engage with others to help identify priorities and discuss how these ideas can be integrated into a coherent design and roadmap for long term development.
Session One (this session): Panelists will give brief talks on metadata projects that have been recently implemented or are near-term opportunities. This discussion will focus on near-term metadata projects and possibilities that can be executed in the latest version. 

Session Two: Panel of speakers will give brief talks on longer term metadata issues and projects that have major design implications. Discussion in this session will focus on opportunities, challenges, and priorities for long term features and design development to facilitate metadata integration and exchange. Offered on Friday at 8:30am EDT

Friday June 19 - Breakout Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

8:30am - 9:30am
Flexible Metadata Session 2
Jim Myers (Global Dataverse Community Consortium)
Katie Mika (IQSS, Harvard University)
Steve McEachern (Australian Data Archive)
Dataverse has a rich and extensible set of metadata capabilities and many members of our community are interested in how to best leverage these capabilities to support researchers and in further expanding Dataverse’s metadata capabilities. This session will be separated into two, hour long breakouts to provide an opportunity to discuss what is currently possible, to learn about the range of potential new features being proposed, and to engage with others to help identify priorities and discuss how these ideas can be integrated into a coherent design and roadmap for long term development.
Session One: Panelists will give brief talks on metadata projects that have been recently implemented or are near-term opportunities. This discussion will focus on near-term metadata projects and possibilities that can be executed in the latest version. Offered on Thursday at 6:00pm EDT.
Session Two (this session): Panel of speakers will give brif talks on longer term metadata issues and projects that have major design implications. Discussion in this session will focus on opportunities, challenges, and priorities for long term features and design development to facilitate metadata integration and exchange.

9:45am - 11:15am
External Tools
Kaitlin Newson (Scholars Portal)
Phil Durbin (IQSS, Harvard University)
 The external tools framework enables developers to create tools that connect to Dataverse to preview, explore, and curate datasets. In this session, we’ll review how external tools work in Dataverse, learn about external tools that have been developed by the community, and discuss ways we might enhance and expand on this powerful Dataverse feature.

11:30am - 1pm
Remote Storage/Large Datasets
Anna Dabrowski (Texas Advanced Computing Center)
Meghan Goodchild (Scholars Portal)
Gustavo Durand (IQSS, Harvard University)
 The Remote Storage/Large Datasets breakout session will include brief presentations from community members and group discussion about improving data transfer mechanisms and storage options for Dataverse repositories. Presentations will cover data upload developments, including Globus integration and direct upload to S3, as well as Trusted Remote Storage Agents (TRSA) for sensitive data, and managing multiple data stores. This session is an opportunity to hear about and provide feedback on the efforts of community members, discuss policy and service implications for managing repositories, and talk about further coordinating our efforts.

1:15pm - 1:45pm
Closing Session/Social Event
 

Wednesday June 17 - Breakout Session

All times Eastern Daylight Time (EDT) (UTC−04:00)

6:00pm - 7:30pm
GDCC/Governance Session 1
Mercè Crosas (Chief Data Science and Technology Officer, IQSS)
Jonathan Crabtree (Assistant Director of Cyberinfrastructure, Odum Institute UNC Chapel Hill)
The session is open to all existing Global Dataverse Community Consortium (GDCC) members and others interested in becoming a member or learning more about GDCC. It will address the following topics:
·   Update on DataCite new fees model and how it affects some GDCC members
·   Changes in GDCC membership from fiscal year to calendar year
·   Governance of GDCC and community contributions, including the proposal for a new external advisory board
·   Current efforts in community collaboration projects
·   Community members input for prioritizing next new services
·   Q&A

Two identical sessions will be provided in order to give an opportunity for all GDCC members to participate. The other session is offered on Thursday at 8:00am EDT.